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Preface 


This  is  a  text  for  a  two-term  course  in  introductory  real  analysis  for  junior  or  senior  math- 
ematics majors  and  science  students  with  a  serious  interest  in  mathematics.  Prospective 
educators  or  mathematically  gifted  high  school  students  can  also  benefit  from  the  mathe- 
matical maturity  that  can  be  gained  from  an  introductory  real  analysis  course. 

The  book  is  designed  to  fill  the  gaps  left  in  the  development  of  calculus  as  it  is  usually 
presented  in  an  elementary  course,  and  to  provide  the  background  required  for  insight  into 
more  advanced  courses  in  pure  and  applied  mathematics.  The  standard  elementary  calcu- 
lus sequence  is  the  only  specific  prerequisite  for  Chapters  1-5,  which  deal  with  real-valued 
functions.  (However,  other  analysis  oriented  courses,  such  as  elementary  differential  equa- 
tion, also  provide  useful  preparatory  experience.)  Chapters  6  and  7  require  a  working 
knowledge  of  determinants,  matrices  and  linear  transformations,  typically  available  from  a 
first  course  in  linear  algebra.  Chapter  8  is  accessible  after  completion  of  Chapters  1-5. 

Without  taking  a  position  for  or  against  the  current  reforms  in  mathematics  teaching,  I 
think  it  is  fair  to  say  that  the  transition  from  elementary  courses  such  as  calculus,  linear 
algebra,  and  differential  equations  to  a  rigorous  real  analysis  course  is  a  bigger  step  to- 
day than  it  was  just  a  few  years  ago.  To  make  this  step  today's  students  need  more  help 
than  their  predecessors  did,  and  must  be  coached  and  encouraged  more.  Therefore,  while 
striving  throughout  to  maintain  a  high  level  of  rigor,  I  have  tried  to  write  as  clearly  and  in- 
formally as  possible.  In  this  connection  I  find  it  useful  to  address  the  student  in  the  second 
person.  I  have  included  295  completely  worked  out  examples  to  illustrate  and  clarify  all 
major  theorems  and  definitions. 

I  have  emphasized  careful  statements  of  definitions  and  theorems  and  have  tried  to  be 
complete  and  detailed  in  proofs,  except  for  omissions  left  to  exercises.  1  give  a  thorough 
treatment  of  real-valued  functions  before  considering  vector-valued  functions.  In  making 
the  transition  from  one  to  several  variables  and  from  real-valued  to  vector- valued  functions, 
I  have  left  to  the  student  some  proofs  that  are  essentially  repetitions  of  earlier  theorems.  I 
believe  that  working  through  the  details  of  straightforward  generalizations  of  more  elemen- 
tary results  is  good  practice  for  the  student. 

Great  care  has  gone  into  the  preparation  of  the  761  numbered  exercises,  many  with 
multiple  parts.  They  range  from  routine  to  very  difficult.  Hints  are  provided  for  the  more 
difficult  parts  of  the  exercises. 


vi 
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Organization 

Chapter  1  is  concerned  with  the  real  number  system.  Section  1.1  begins  with  a  brief  dis- 
cussion of  the  axioms  for  a  complete  ordered  field,  but  no  attempt  is  made  to  develop  the 
reals  from  them;  rather,  it  is  assumed  that  the  student  is  familiar  with  the  consequences  of 
these  axioms,  except  for  one:  completeness.  Since  the  difference  between  a  rigorous  and 
nonrigorous  treatment  of  calculus  can  be  described  largely  in  terms  of  the  attitude  taken 
toward  completeness,  I  have  devoted  considerable  effort  to  developing  its  consequences. 
Section  1.2  is  about  induction.  Although  this  may  seem  out  of  place  in  a  real  analysis 
course,  I  have  found  that  the  typical  beginning  real  analysis  student  simply  cannot  do  an 
induction  proof  without  reviewing  the  method.  Section  1.3  is  devoted  to  elementary  set  the- 
ory and  the  topology  of  the  real  line,  ending  with  the  Heine-Borel  and  Bolzano-Weierstrass 
theorems. 

Chapter  2  covers  the  differential  calculus  of  functions  of  one  variable:  limits,  continu- 
ity, differentiablility,  L'Hospital's  rule,  and  Taylor's  theorem.  The  emphasis  is  on  rigorous 
presentation  of  principles;  no  attempt  is  made  to  develop  the  properties  of  specific  ele- 
mentary functions.  Even  though  this  may  not  be  done  rigorously  in  most  contemporary 
calculus  courses,  I  believe  that  the  student's  time  is  better  spent  on  principles  rather  than 
on  reestablishing  familiar  formulas  and  relationships. 

Chapter  3  is  to  devoted  to  the  Riemann  integral  of  functions  of  one  variable.  In  Sec- 
tion 3.1  the  integral  is  defined  in  the  standard  way  in  terms  of  Riemann  sums.  Upper  and 
lower  integrals  are  also  defined  there  and  used  in  Section  3.2  to  study  the  existence  of  the 
integral.  Section  3.3  is  devoted  to  properties  of  the  integral.  Improper  integrals  are  studied 
in  Section  3.4.  I  believe  that  my  treatment  of  improper  integrals  is  more  detailed  than  in 
most  comparable  textbooks.  A  more  advanced  look  at  the  existence  of  the  proper  Riemann 
integral  is  given  in  Section  3.5,  which  concludes  with  Lebesgue's  existence  criterion.  This 
section  can  be  omitted  without  compromising  the  student's  preparedness  for  subsequent 
sections. 

Chapter  4  treats  sequences  and  series.  Sequences  of  constant  are  discussed  in  Sec- 
tion 4.1.  I  have  chosen  to  make  the  concepts  of  limit  inferior  and  limit  superior  parts 
of  this  development,  mainly  because  this  permits  greater  flexibility  and  generality,  with 
little  extra  effort,  in  the  study  of  infinite  series.  Section  4.2  provides  a  brief  introduction 
to  the  way  in  which  continuity  and  differentiability  can  be  studied  by  means  of  sequences. 
Sections  4.3-4.5  treat  infinite  series  of  constant,  sequences  and  infinite  series  of  functions, 
and  power  series,  again  in  greater  detail  than  in  most  comparable  textbooks.  The  instruc- 
tor who  chooses  not  to  cover  these  sections  completely  can  omit  the  less  standard  topics 
without  loss  in  subsequent  sections. 

Chapter  5  is  devoted  to  real-valued  functions  of  several  variables.  It  begins  with  a  dis- 
cussion of  the  toplogy  of  M"  in  Section  5.1.  Continuity  and  differentiability  are  discussed 
in  Sections  5.2  and  5.3.  The  chain  rule  and  Taylor's  theorem  are  discussed  in  Section  5.4. 
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Chapter  6  covers  the  differential  calculus  of  vector-valued  functions  of  several  variables. 
Section  6.1  reviews  matrices,  determinants,  and  linear  transformations,  which  are  integral 
parts  of  the  differential  calculus  as  presented  here.  In  Section  6.2  the  differential  of  a 
vector-valued  function  is  defined  as  a  linear  transformation,  and  the  chain  rule  is  discussed 
in  terms  of  composition  of  such  functions.  The  inverse  function  theorem  is  the  subject  of 
Section  6.3,  where  the  notion  of  branches  of  an  inverse  is  introduced.  In  Section  6.4.  the 
implicit  function  theorem  is  motivated  by  first  considering  linear  transformations  and  then 
stated  and  proved  in  general. 

Chapter  7  covers  the  integral  calculus  of  real-valued  functions  of  several  variables.  Mul- 
tiple integrals  are  defined  in  Section  7.1,  first  over  rectangular  parallelepipeds  and  then 
over  more  general  sets.  The  discussion  deals  with  the  multiple  integral  of  a  function  whose 
discontinuities  form  a  set  of  Jordan  content  zero.  Section  7.2  deals  with  the  evaluation  by 
iterated  integrals.  Section  7.3  begins  with  the  definition  of  Jordan  measurability,  followed 
by  a  derivation  of  the  rule  for  change  of  content  under  a  linear  transformation,  an  intuitive 
formulation  of  the  rule  for  change  of  variables  in  multiple  integrals,  and  finally  a  careful 
statement  and  proof  of  the  rule.  The  proof  is  complicated,  but  this  is  unavoidable. 

Chapter  8  deals  with  metric  spaces.  The  concept  and  properties  of  a  metric  space  are 
introduced  in  Section  8.1.  Section  8.2  discusses  compactness  in  a  metric  space,  and  Sec- 
tion 8.3  discusses  continuous  functions  on  metric  spaces. 

Corrections-mathematical  and  typographical-are  welcome  and  will  be  incorporated  when 
received. 

William  F.  Trench 
wtrench@trinity.edu 
Home:  659  Hopkinton  Road 
Hopkinton,  NH  03229 


CHAPTER  1 
The  Real  Numbers 


IN  THIS  CHAPTER  we  begin  the  study  of  the  real  number  system.  The  concepts  discussed 
here  will  be  used  throughout  the  book. 

SECTION  1.1  deals  with  the  axioms  that  define  the  real  numbers,  definitions  based  on 
them,  and  some  basic  properties  that  follow  from  them. 

SECTION  1 .2  emphasizes  the  principle  of  mathematical  induction. 

SECTION  1.3  introduces  basic  ideas  of  set  theory  in  the  context  of  sets  of  real  num- 
bers. In  this  section  we  prove  two  fundamental  theorems:  the  Heine-Borel  and  Bolzano- 
Weierstrass  theorems. 


1.1  THE  REAL  NUMBER  SYSTEM 

Having  taken  calculus,  you  know  a  lot  about  the  real  number  system;  however,  you  prob- 
ably do  not  know  that  all  its  properties  follow  from  a  few  basic  ones.  Although  we  will 
not  carry  out  the  development  of  the  real  number  system  from  these  basic  properties,  it  is 
useful  to  state  them  as  a  starting  point  for  the  study  of  real  analysis  and  also  to  focus  on 
one  property,  completeness,  that  is  probably  new  to  you. 

Field  Properties 

The  real  number  system  (which  we  will  often  call  simply  the  reals)  is  first  of  all  a  set 
{a,  b,  c, . . .}  on  which  the  operations  of  addition  and  multiplication  are  defined  so  that 
every  pair  of  real  numbers  has  a  unique  sum  and  product,  both  real  numbers,  with  the 
following  properties. 

(A)  a  +  b  =  b  +  a  and  ab  =  ba  (commutative  laws). 

(B)  (a  +  b)  +  c  =  a  +  (b  +  c)  and  (ab)c  =  a(bc)  (associative  laws). 

(C)  a(b  +  c)  =  ab  +  ac  (distributive  law). 

(D)  There  are  distinct  real  numbers  0  and  1  such  that  a  +  0  =  a  and  al  =  a  for  all  a. 

(E)  For  each  a  there  is  a  real  number  —a  such  that  a  +  (—a)  =  0,  and  if  a  ^  0,  there  is 
areal  number  l/a  such  thata(l/a)  =  1. 
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The  manipulative  properties  of  the  real  numbers,  such  as  the  relations 

(«  +  b)2  =  a2  +  lab  +  b2, 
(3a  +  2b)(Ac  +  2d)  =  \2ac  +  6ad  +  8fcc  +  Abd. 

(—a)  =  (-l)a,    a(—b)  =  (—a)b  =  —ab. 

and 

a      c      ad  +  be         ,   ,  . 
b  +  d=^d-  (^0)< 

all  follow  from  (A)-(E).  We  assume  that  you  are  familiar  with  these  properties. 

A  set  on  which  two  operations  are  defined  so  as  to  have  properties  ( A)-(E)  is  called  a 
field.  The  real  number  system  is  by  no  means  the  only  field.  The  rational  numbers  (which 
are  the  real  numbers  that  can  be  written  as  r  =  p/q,  where  p  and  q  are  integers  and  q  ^  0) 
also  form  a  field  under  addition  and  multiplication.  The  simplest  possible  field  consists  of 
two  elements,  which  we  denote  by  0  and  1,  with  addition  defined  by 

0  +  0=1  +  1=0,     1+0  =  0+1  =  1,  (1.1.1) 

and  multiplication  defined  by 

0-0  =  0- 1  =  1 -0  =  0,     1-1  =  1  (1.1.2) 

(Exercise  1.1.2). 

The  Order  Relation 

The  real  number  system  is  ordered  by  the  relation  <,  which  has  the  following  properties. 


(F)  For  each  pair  of  real  numbers  a  and  b,  exactly  one  of  the  following  is  true: 

a  =  b,    a  <  b.    or    b  <  a. 

(G)  If  a  <  b  and  b  <  c,  then  a  <  c.  (The  relation  <  is  transitive) 

(H)  If  a  <  b,  then  a  +  c  <  b  +  c  for  any  c,  and  if  0  <  c,  then  ac  <  be. 

A  field  with  an  order  relation  satisfying  (F)-(H)  is  an  ordered  field.  Thus,  the  real 
numbers  form  an  ordered  field.  The  rational  numbers  also  form  an  ordered  field,  but  it  is 
impossible  to  define  an  order  on  the  field  with  two  elements  defined  by  (1.1.1)  and  (1.1.2) 
so  as  to  make  it  into  an  ordered  field  (Exercise  1.1.2). 

We  assume  that  you  are  familiar  with  other  standard  notation  connected  with  the  order 
relation:  thus,  a  >  b  means  that  b  <  a;  a  >  b  means  that  either  a  =  b  or  a  >  b;  a  <  b 
means  that  either  a  =  b  or  a  <  b;  the  absolute  value  of  a,  denoted  by  \a\,  equals  a  if 
a  >  0  or  —a  if  a  <  0.  (Sometimes  we  call  \a\  the  magnitude  of  a.) 

You  probably  know  the  following  theorem  from  calculus,  but  we  include  the  proof  for 
your  convenience. 
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Theorem  1.1.1  (The  Triangle  Inequality)  If  a  and  bare  any  two  real  numbers, 
then 

|a+6|<|c|  +  |6|.  (1.1.3) 
Proof   There  are  four  possibilities: 

(a)  If  a  >  0  and  b  >  0,  then  a  +  b  >  0,  so  \a  +  b\  =  a  +  b  =  \a\  +  \b\. 

(b)  If  a  <  0  and  6  <  0,  thena  +  b  <  0,  so  \a  +  b\  =  -a  +  (-b)  =  \a\  +  \b\. 

(c)  If  a  >  0  and  ft  <  0,  then  a  +  b  =  \a\ -  \b\. 

(d)  If  a  <  0  and  b  >  0,  then  a  +  b  =  —\a\  +  \b\.  Eq.  1.1.3  holds  in  either  case,  since 

\\b\-\a\     if  \b\>\a\, 

The  triangle  inequality  appears  in  various  forms  in  many  contexts.  It  is  the  most  impor- 
tant inequality  in  mathematics.  We  will  use  it  often. 

Corollary  1.1.2  If  a  and  b  are  any  two  real  numbers,  then 

\a-b\>  \\a\-\b\\  (1.1.4) 

and 

\a+b\>\\a\-\b\\.  (1.1.5) 
Proof   Replacing  a  by  a  —  b  in  (1.1.3)  yields 

\a\  <  \a  -  b\  +  \b\, 

so 

\a -b\  >  \a\ -  \b\.  (1.1.6) 

Interchanging  a  and  b  here  yields 

\b-a\  >  \b\ -  \a\, 

which  is  equivalent  to 

\a-b\>\b\-\a\,  (1.1.7) 

since  \b  —  a\  =  \a  —  b\.  Since 

;|o|-|6|   if  |a[>[i|, 
if  |6|>|c|, 


\W\-\b\\  = 


(1.1.6)  and  (1.1.7)  imply  (1.1.4).  Replacing  b  by  -b  in  (1.1.4)  yields  (1.1.5),  since  \  -b\  = 
\b\.  '  TJ 

Supremum  of  a  Set 

A  set  S  of  real  numbers  is  bounded  above  if  there  is  a  real  number  b  such  that  x  <  b 
whenever  x  e  S.  In  this  case,  b  is  an  upper  bound  of  S .  If  b  is  an  upper  bound  of  S, 
then  so  is  any  larger  number,  because  of  property  (G) .  If  f>  is  an  upper  bound  of  S,  but  no 
number  less  than  /3  is,  then  ft  is  a  supremum  of  S,  and  we  write 

P  =  sup  S. 
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With  the  real  numbers  associated  in  the  usual  way  with  the  points  on  a  line,  these  defini- 
tions can  be  interpreted  geometrically  as  follows:  b  is  an  upper  bound  of  S  if  no  point  of  5 
is  to  the  right  of  b;  /3  =  sup  S  if  no  point  of  S  is  to  the  right  of  /3,  but  there  is  at  least  one 
point  of  S  to  the  right  of  any  number  less  than  /3  (Figure  1.1.1). 


p  b 

(S  =  dark  line  segments) 

Figure  1.1.1 

Example  1.1.1  If  S  is  the  set  of  negative  numbers,  then  any  nonnegative  number  is  an 
upper  bound  of  S,  and  sup  5  =  0.  If  Si  is  the  set  of  negative  integers,  then  any  number  a 
such  that  a  >  —  1  is  an  upper  bound  of  Si ,  and  sup  Si  =  —  1 .  ■ 

This  example  shows  that  a  supremum  of  a  set  may  or  may  not  be  in  the  set,  since  Si 
contains  its  supremum,  but  S  does  not. 

A  nonempty  set  is  a  set  that  has  at  least  one  member.  The  empty  set,  denoted  by  0,  is  the 
set  that  has  no  members.  Although  it  may  seem  foolish  to  speak  of  such  a  set,  we  will  see 
that  it  is  a  useful  idea. 

The  Completeness  Axiom 

It  is  one  thing  to  define  an  object  and  another  to  show  that  there  really  is  an  object  that 
satisfies  the  definition.  (For  example,  does  it  make  sense  to  define  the  smallest  positive 
real  number?)  This  observation  is  particularly  appropriate  in  connection  with  the  definition 
of  the  supremum  of  a  set.  For  example,  the  empty  set  is  bounded  above  by  every  real 
number,  so  it  has  no  supremum.  (Think  about  this.)  More  importantly,  we  will  see  in 
Example  1.1.2  that  properties  (A)-(H)  do  not  guarantee  that  every  nonempty  set  that 
is  bounded  above  has  a  supremum.  Since  this  property  is  indispensable  to  the  rigorous 
development  of  calculus,  we  take  it  as  an  axiom  for  the  real  numbers. 

(I)    If  a  nonempty  set  of  real  numbers  is  bounded  above,  then  it  has  a  supremum. 

Property  (I)  is  called  completeness,  and  we  say  that  the  real  number  system  is  a  complete 
ordered  field.  It  can  be  shown  that  the  real  number  system  is  essentially  the  only  complete 
ordered  field;  that  is,  if  an  alien  from  another  planet  were  to  construct  a  mathematical 
system  with  properties  (A)-(I),  the  alien's  system  would  differ  from  the  real  number 
system  only  in  that  the  alien  might  use  different  symbols  for  the  real  numbers  and  +,  •, 
and  <. 

Theorem  1.1.3  If  a  nonempty  set  S  of  real  numbers  is  bounded  above,  then  sup  S  is 
the  unique  real  number  ft  such  that 

(a)  x  <  P  for  all  x  in  S; 

(b)  if  e  >  0  (no  matter  how  small),  there  is  an  Xq  in  S  such  that  Xq  >  P  —  €. 


Section  1 . 1  The  Real  Number  System  5 


Proof  We  first  show  that  /J  =  sup  5  has  properties  (a)  and  (b).  Since  /3  is  an  upper 
bound  of  S,  it  must  satisfy  (a) .  Since  any  real  number  a  less  than  p1  can  be  written  as  j3  —  e 
with  €  =  p1  —  a  >  0,  (b)  is  just  another  way  of  saying  that  no  number  less  than  ft  is  an 
upper  bound  of  S.  Hence,  ft  =  supS  satisfies  (a)  and  (b). 

Now  we  show  that  there  cannot  be  more  than  one  real  number  with  properties  (a)  and 
(b).  Suppose  that  fa  <  fa  and  fa  has  property  (b);  thus,  if  e  >  0,  there  is  an  xo  in  S 
such  that  .To  >  fa  —  e.  Then,  by  taking  e  =  fa  —  fa,  we  see  that  there  is  an  xq  in  5  such 
that 

x0>  fa-(fa-fa)  =  fa, 

so  fa  cannot  have  property  (a).  Therefore,  there  cannot  be  more  than  one  real  number 
that  satisfies  both  (a)  and  (b).  TJ 

Some  Notation 

We  will  often  define  a  set  S  by  writing  S  =  [x  |  ■  •  ■  },  which  means  that  S  consists  of  all 
x  that  satisfy  the  conditions  to  the  right  of  the  vertical  bar;  thus,  in  Example  1.1.1, 

S  =  {x\x<0}  (1.1.8) 

and 

Si  =  {x  |  x  is  a  negative  integer} . 

We  will  sometimes  abbreviate  "x  is  a  member  of  S"  by  x  e  5,  and  "x  is  not  a  member  of 
S"  by  x  £  S.  For  example,  if  S  is  defined  by  (1.1.8),  then 

-1  e  S    but    0  i  S. 
The  Archimedean  Property 

The  property  of  the  real  numbers  described  in  the  next  theorem  is  called  the  Archimedean 
property.  Intuitively,  it  states  that  it  is  possible  to  exceed  any  positive  number,  no  matter 
how  large,  by  adding  an  arbitrary  positive  number,  no  matter  how  small,  to  itself  sufficiently 
many  times. 

Theorem  1.1.4  (Archimedean  Property)  If  p  and  e  are  positive,  then  ne  > 
pfor  some  integer  n . 

Proof  The  proof  is  by  contradiction.  If  the  statement  is  false,  p  is  an  upper  bound  of 
the  set 

S  =  {x\x  =  ne,  n  is  an  integer}  . 
Therefore,  S  has  a  supremum  fa  by  property  (I).  Therefore, 

ne  <  P    for  all  integers  n .  (1-1.9) 
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Since  n  +  1  is  an  integer  whenever  n  is,  (1.1 .9)  implies  that 

(n  +  l)e  <p 

and  therefore 

ne  <  fi  -  e 

for  all  integers  n.  Hence,  —  e  is  an  upper  bound  of  S.  Since  /3  —  e  <  /3,  this  contradicts 
the  definition  of  ft.  H 

Density  of  the  Rationals  and  Irrationals 

Definition  1.1.5  A  set  D  is  dense  in  the  reals  if  every  open  interval  {a,  b)  contains  a 
member  of  D . 

Theorem  1.1.6  The  rational  numbers  are  dense  in  the  reals;  that  is,  if  a  and  b  are 
real  numbers  with  a  <  b,  there  is  a  rational  number  p/q  such  that  a  <  p/q  <  b. 

Proof  From  Theorem  1 . 1 .4  with  p  =  1  and  e  =  b  —  a,  there  is  a  positive  integer  q  such 
that  q(b  —  a)  >  1.  There  is  also  an  integer  j  such  that  j  >  qa.  This  is  obvious  if  a  <  0, 
and  it  follows  from  Theorem  1.1.4  with  6=1  and  p  =  qa  if  a  >  0.  Let  p  be  the  smallest 
integer  such  that  p  >  qa.  Then  p  —  1  <  qa,  so 

qa  <  p  <  qa  +  1. 

Since  1  <  q(b  —  a),  this  implies  that 

qa  <  p  <  qa  +  q(b  —  a)  =  qb, 

so  qa  <  p  <  qb.  Therefore,  a  <  p/q  <  b.  H 

Example  1.1.2  The  rational  number  system  is  not  complete;  that  is,  a  set  of  rational 
numbers  may  be  bounded  above  (by  rationals),  but  not  have  a  rational  upper  bound  less 
than  any  other  rational  upper  bound.  To  see  this,  let 

S  =  {r  |  r  is  rational  and  r2  <  2}  . 

If  r  €  S,  then  r  <  V2.  Theorem  1.1.6  implies  that  if  e  >  0  there  is  a  rational  number  ro 
such  that  y/2  —  e  <  ro  <  V2,  so  Theorem  1.1.3  implies  that  V2  =  supS.  However,  \/2is 
irrational;  that  is,  it  cannot  be  written  as  the  ratio  of  integers  (Exercise  1.1.3).  Therefore, 
if  r\  is  any  rational  upper  bound  of  S,  then  ~J2  <  r\ .  By  Theorem  1.1.6,  there  is  a  rational 
number  r-i  such  that  °J2  <r-i  <r\.  Since  r-i  is  also  a  rational  upper  bound  of  S,  this  shows 
that  5  has  no  rational  supremum.  ■ 

Since  the  rational  numbers  have  properties  (A)-(H),  but  not  (I),  this  example  shows 
that  (I)  does  not  follow  from  (A)-(H). 


Theorem  1.1.7  The  set  of  irrational  numbers  is  dense  in  the  reals ;  that  is,  if  a  and  b 
are  real  numbers  with  a  <  b,  there  is  an  irrational  number  t  such  that  a  <  t  <  b. 
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Proof   From  Theorem  1.1.6,  there  are  rational  numbers  r\  and  r-i  such  that 

a  <  r\  <  r%  <  b. 


(1.1.10) 


t  =  r\  +  —=(r2  -  n). 


Then  t  is  irrational  (why?)  and  r\  <  t  <  r%,  so  a  <  t  <  b,  from  (1.1.10). 


H 


Infimum  of  a  Set 

A  set  S  of  real  numbers  is  bounded  below  if  there  is  a  real  number  a  such  that  x  >  a 
whenever  x  e  S.  In  this  case,  a  is  a  lower  bound  of  S.  If  a  is  a  lower  bound  of  S,  so  is 
any  smaller  number,  because  of  property  (G).  If  a  is  a  lower  bound  of  S,  but  no  number 
greater  than  a  is,  then  a  is  an  infimum  of  S,  and  we  write 

a  =  inf  S. 

Geometrically,  this  means  that  there  are  no  points  of  S  to  the  left  of  a,  but  there  is  at  least 
one  point  of  S  to  the  left  of  any  number  greater  than  a . 

Theorem  1.1.8  If  a  nonempty  set  S  of  real  numbers  is  bounded  below,  then  infS  is 
the  unique  real  number  a  such  that 

(a)  x  >  a  for  all  x  in  S  ; 

(b)  if  €  >  0  (no  matter  how  small ),  there  is  an  Xq  in  S  such  that  Xq  <  a  +  e. 

Proof   (Exercise  1.1.6) 

A  set  S  is  bounded  if  there  are  numbers  a  and  b  such  that  a  <  x  <  b  for  all  x  in  S.  A 
bounded  nonempty  set  has  a  unique  supremum  and  a  unique  infimum,  and 


The  Extended  Real  Number  System 

A  nonempty  set  S  of  real  numbers  is  unbounded  above  if  it  has  no  upper  bound,  or  un- 
bounded below  if  it  has  no  lower  bound.  It  is  convenient  to  adjoin  to  the  real  number 
system  two  fictitious  points,  +oo  (which  we  usually  write  more  simply  as  oo)  and  — oo, 
and  to  define  the  order  relationships  between  them  and  any  real  number  x  by 


inf  S  <  sup  5 


(1.1.11) 


(Exercise  1.1.7). 


—  OO  <  X  <  oo. 


(1.1.12) 


We  call  oo  and  — oo  points  at  infinity.  If  S  is  a  nonempty  set  of  reals,  we  write 

sup  5  =  oo 
to  indicate  that  S  is  unbounded  above,  and 


(1.1.13) 


inf  5  =  — 


(1.1.14) 


to  indicate  that  S  is  unbounded  below. 
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Example  1.1.3  If 

S  =  {x\x  <2}  , 

then  sup  S  =  2  and  inf  5  =  — oo.  If 

S  =  {x  |  x  >  -2}  , 

then  sup  S  =  oo  and  inf  5  =  —2.  If  S  is  the  set  of  all  integers,  then  sup  S  =  oo  and 
infS  =  — oo.  ■ 

The  real  number  system  with  oo  and  — oo  adjoined  is  called  the  extended  real  number 
system,  or  simply  the  extended  reals.  A  member  of  the  extended  reals  differing  from  — oo 
and  oo  is  finite;  that  is,  an  ordinary  real  number  is  finite.  However,  the  word  "finite"  in 
"finite  real  number"  is  redundant  and  used  only  for  emphasis,  since  we  would  never  refer 
to  oo  or  —oo  as  real  numbers. 

The  arithmetic  relationships  among  oo,  — oo,  and  the  real  numbers  are  defined  as  follows. 

(a)    If  a  is  any  real  number,  then 

a  +  oo  =    oo  +  a  =  oo, 

a  —  oo  =  — oo  +  a  =  — oo, 

a  a 

00      — oo 


(b)  lfa>0,  then 

(c)  If  a  <  0,  then 
We  also  define 


a  oo  =  ooa  =  oo, 
a  (—  oo )  =  (— oo)a  =  — oo. 


a  oo  =  ooa  =  —oo, 
a  (— oo )  =  (— oo)a  =  oo. 


oo  +  oo  =  oooo  =  (— oo)(—  oo)  =  oo 

and 

— oo  —  oo  =  oo(— oo)  =  (— oo)oo  =  — oo. 

Finally,  we  define 

|oo|  =  I  —  oo|  =  oo. 

The  introduction  of  oo  and  — oo,  along  with  the  arithmetic  and  order  relationships  defined 
above,  leads  to  simplifications  in  the  statements  of  theorems.  For  example,  the  inequality 
(1.1.11),  first  stated  only  for  bounded  sets,  holds  for  any  nonempty  set  S  if  it  is  interpreted 
properly  in  accordance  with  (1.1.12)  and  the  definitions  of  (1.1.13)  and  (1.1.14).  Exer- 
cises 1.1. 10(b)  and  1.1.1 1(b)  illustrate  the  convenience  afforded  by  some  of  the  arith- 
metic relationships  with  extended  reals,  and  other  examples  will  illustrate  this  further  in 
subsequent  sections. 
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It  is  not  useful  to  define  oo  —  oo,  0  ■  oo,  oo/oo,  and  0/0.  They  are  called  indeterminate 
forms,  and  left  undefined.  You  probably  studied  indeterminate  forms  in  calculus;  we  will 
look  at  them  more  carefully  in  Section  2.4. 

1.1  Exercises 


1.  Write  the  following  expressions  in  equivalent  forms  not  involving  absolute  values, 
(a)  a  +  b  +  \a  -  b\  (b)  a  +  b  -  \a  -  b\ 

(c)  a  +  b  +  2c  +  \a  -  b\  +  \a  +  b -2c  +  \a  -  b\\ 

(d)  a  +  b  +  2c- \a -b\  -  \a  +  b-2c  -  \a -b\\ 

2.  Verify  that  the  set  consisting  of  two  members,  0  and  1,  with  operations  defined  by 
Eqns.  (1.1.1)  and  (1.1.2),  is  a  field.  Then  show  that  it  is  impossible  to  define  an  order 
<  on  this  field  that  has  properties  (F),  (G),  and  (H). 

3.  Show  that  a/2  is  irrational.  Hint:  Show  that  if  \[2  =  m/n,  where  m  and  n  are 
integers,  then  both  m  and  n  must  be  even.  Obtain  a  contradiction  from  this. 

4.  Show  that  ^fp  is  irrational  if  p  is  prime. 

5.  Find  the  supremum  and  infimum  of  each  5.  State  whether  they  are  in  S . 


(a) 

S  = 

{*l 

x  =  -(1/n)  +  [1  +  (- 

-l)n]n2,n  > 

(b) 

S  = 

\x2  <  9} 

(c) 

s  = 

x2  <  7} 

(d) 

s  = 

{* 

\\2x  +  1|  <  5} 

(e) 

s  = 

{*\ 

(x2  +            >  1} 

(f) 

s  = 

l*\ 

x  =  rational  and  x2  < 

7} 

6.  Prove  Theorem  1.1.8.  Hint:  The  set  T  =  {x  |  -ieS)  is  bounded  above  if  S  is 
bounded  below.  Apply  property  (I)  and  Theorem  1.1.3  to  T. 

7.  (a)   Show  that 

inf  S  <  sup  S  (A) 

for  any  nonempty  set  5  of  real  numbers,  and  give  necessary  and  sufficient 
conditions  for  equality, 
(b)    Show  that  if  S  is  unbounded  then  (A)  holds  if  it  is  interpreted  according  to 
Eqn.  ( 1 . 1 . 1 2)  and  the  definitions  of  Eqns.  ( 1 . 1 . 1 3)  and  ( 1 . 1 . 14). 

8.  Let  5  and  T  be  nonempty  sets  of  real  numbers  such  that  every  real  number  is  in  S 
or  T  and  if  s  e  S  and  t  €  T,  then  s  <  t .  Prove  that  there  is  a  unique  real  number  ft 
such  that  every  real  number  less  than  ft  is  in  S  and  every  real  number  greater  than 
P  is  in  T.  (A  decomposition  of  the  reals  into  two  sets  with  these  properties  is  a 
Dedekind  cut.  This  is  known  as  Dedekind's  theorem.) 
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9.  Using  properties  (A)-(H)  of  the  real  numbers  and  taking  Dedekind's  theorem 
(Exercise  1.1.8)  as  given,  show  that  every  nonempty  set  U  of  real  numbers  that  is 
bounded  above  has  a  supremum.  Hint:  Let  T  be  the  set  of  upper  bounds  ofU  and 
S  be  the  set  of  real  numbers  that  are  not  upper  bounds  ofU. 

10.  Let  S  and  T  be  nonempty  sets  of  real  numbers  and  define 

S  +  T  =  {s  +  t  |  s  €  S,  t  €  T) . 

(a)  Show  that 

sup(5  +  T)  =  sup  S  +  sup  T  (A) 
if  S  and  T  are  bounded  above  and 

inf(S  +  T)  =  inf  S  +  inf  T  (B) 

if  S  and  T  are  bounded  below. 

(b)  Show  that  if  they  are  properly  interpreted  in  the  extended  reals,  then  (A)  and 
(B)  hold  if  S  and  T  are  arbitrary  nonempty  sets  of  real  numbers. 

1 1 .  Let  S  and  T  be  nonempty  sets  of  real  numbers  and  define 

S-T  =  {s-t  \s  €  S,t  €  T}. 

(a)  Show  that  if  S  and  T  are  bounded,  then 

sup(S  —  T)  =  sup  5  —  inf  T  (A) 

and 

inf (5  -  T)  =  inf  S  -  sup  T.  (B) 

(b)  Show  that  if  they  are  properly  interpreted  in  the  extended  reals,  then  (A)  and 
(B)  hold  if  S  and  T  are  arbitrary  nonempty  sets  of  real  numbers. 

12.  Let  5  be  a  bounded  nonempty  set  of  real  numbers,  and  let  a  and  b  be  fixed  real 
numbers.  Define  T  =  {as  +  b\s  6  S}.  Find  formulas  for  sup  T  and  inf  T  in  terms 
of  sup  S  and  inf  5.  Prove  your  formulas. 

1.2  MATHEMATICAL  INDUCTION 

If  a  flight  of  stairs  is  designed  so  that  falling  off  any  step  inevitably  leads  to  falling  off  the 
next,  then  falling  off  the  first  step  is  a  sure  way  to  end  up  at  the  bottom.  Crudely  expressed, 
this  is  the  essence  of  the  principle  of  mathematical  induction:  If  the  truth  of  a  statement 
depending  on  a  given  integer  n  implies  the  truth  of  the  corresponding  statement  with  n 
replaced  by  n  +  1,  then  the  statement  is  true  for  all  positive  integers  n  if  it  is  true  for  n  =  1. 
Although  you  have  probably  studied  this  principle  before,  it  is  so  important  that  it  merits 
careful  review  here. 

Peano's  Postulates  and  Induction 

The  rigorous  construction  of  the  real  number  system  starts  with  a  set  N  of  undefined  ele- 
ments called  natural  numbers,  with  the  following  properties. 
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(A)  N  is  nonempty. 

(B)  Associated  with  each  natural  number  n  there  is  a  unique  natural  number  n'  called 
the  successor  of  n. 

(C)  There  is  a  natural  number  n  that  is  not  the  successor  of  any  natural  number. 

(D)  Distinct  natural  numbers  have  distinct  successors;  that  is,  \fn^m,  then  n'  ^  m' . 

(E)  The  only  subset  of  N  that  contains  n  and  the  successors  of  all  its  elements  is  N 
itself. 

These  axioms  are  known  as  Pecmo's  postulates.  The  real  numbers  can  be  constructed 
from  the  natural  numbers  by  definitions  and  arguments  based  on  them.  This  is  a  formidable 
task  that  we  will  not  undertake.  We  mention  it  to  show  how  little  you  need  to  start  with  to 
construct  the  reals  and,  more  important,  to  draw  attention  to  postulate  (E),  which  is  the 
basis  for  the  principle  of  mathematical  induction. 

It  can  be  shown  that  the  positive  integers  form  a  subset  of  the  reals  that  satisfies  Peano's 
postulates  (with  n  =  1  and  n'  =  n  +  1),  and  it  is  customary  to  regard  the  positive  integers 
and  the  natural  numbers  as  identical.  From  this  point  of  view,  the  principle  of  mathematical 
induction  is  basically  a  restatement  of  postulate  (E). 

Theorem  1.2.1  (Principle  of  Mathematical  Induction)  Let  Pi,  P2,-- , 
Pn,  . . .  be  propositions,  one  for  each  positive  integer,  such  that 

(a)  P\  is  true; 

(b)  for  each  positive  integer  n,  Pn  implies  Pn+\- 
Then  P„  is  true  for  each  positive  integer  n. 

Proof  Let 

M  =  [n  |  n  E  N  and  P„  is  true}  . 

From  (a),  1  E  M,  and  from  (b),  n  +  1  E  M  whenever  n  £  M.  Therefore,  M  =  N,  by 
postulate  (E).  TJ 

Example  1.2.1  Let  P„  be  the  proposition  that 

n(n  +  1) 

1  +2  +  ---  +  W  =       -      .  (1.2.1) 

Then  Pi  is  the  proposition  that  1=1,  which  is  certainly  true.  If  Pn  is  true,  then  adding 
n  +  1  to  both  sides  of  (1.2.1)  yields 

n(n  +  1) 

(1  +  2  +  •••  +  «)  +  («  +  1)  =  '  +  (n  +  1) 

=  («  +  !)(!  +  !) 

_  (n  +  \)(n  +  2) 
2 

or 

(n  +  l)(n  +  2) 
1  +  2  +  •■•  +  («+  1)  =  '"2        ' , 
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which  is  Pn+\,  since  ithas  the  form  of  (1.2.1),  with  n  replaced  by«  +  l.  Hence,  Pn  implies 
Pn+i,  so  (1.2.1)  is  true  for  all  n,  by  Theorem  1.2.1.  ■ 

A  proof  based  on  Theorem  1.2.1  is  an  induction  proof,  or  proof  by  induction.  The 
assumption  that  Pn  is  true  is  the  induction  assumption.  (Theorem  1.2.3  permits  a  kind  of 
induction  proof  in  which  the  induction  assumption  takes  a  different  form.) 

Induction,  by  definition,  can  be  used  only  to  verify  results  conjectured  by  other  means. 
Thus,  in  Example  1.2.1  we  did  not  use  induction  to  find  the  sum 

Sn  =  1  +  2  H  \-n\  (1-2.2) 

rather,  we  verified  that 

n(n  +  1) 

s„  =    -  2    -■  (1.2.3) 

How  you  guess  what  to  prove  by  induction  depends  on  the  problem  and  your  approach  to 
it.  For  example,  (1.2.3)  might  be  conjectured  after  observing  that 

1-2  2-3  4-3 

Si  =  1  =   ,      S2  =  3  =   ,      S3  =  6  =   . 

2  2  2 

However,  this  requires  sufficient  insight  to  recognize  that  these  results  are  of  the  form 
(1.2.3)  for  n  =  1,  2,  and  3.  Although  it  is  easy  to  prove  (1.2.3)  by  induction  once  it  has 
been  conjectured,  induction  is  not  the  most  efficient  way  to  find  sn,  which  can  be  obtained 
quickly  by  rewriting  (1.2.2)  as 

s„  =  n  +  (n-  1)  H  hi 

and  adding  this  to  (1.2.2)  to  obtain 

2s„  =  [n  +  1]  +  [(»  -  1)  +  2]  +  •••  +  [1  +  n]. 

There  are  n  bracketed  expressions  on  the  right,  and  the  terms  in  each  add  up  to  n  +  1; 
hence, 

2sn  =  n(n  +  1), 

which  yields  (1.2.3). 

The  next  two  examples  deal  with  problems  for  which  induction  is  a  natural  and  efficient 
method  of  solution. 

Example  1.2.2  Letai  =  1  and 

1 

a„+i  =  -a„,    n  >  1  (1-2.4) 

n  +  1 

(we  say  that  an  is  defined  inductively),  and  suppose  that  we  wish  to  find  an  explicit  formula 
for  an .  By  considering  n  =  1,2,  and  3,  we  find  that 

1  1  1 

a\  =  — ,    fl2  =   •     and  a% 


1  1-2  1-2-3 
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and  therefore  we  conjecture  that 

a„  =  -i  (1.2.5) 
This  is  given  for  n  =  1 .  If  we  assume  it  is  true  for  some  n,  substituting  it  into  ( 1 .2.4)  yields 

1     1  1 
°n+1  ~~  n  +  \~n\  ~  (n  +  1)!' 

which  is  (1.2.5)  with  n  replaced  by  n  +  1.  Therefore,  (1.2.5)  is  true  for  every  positive 
integer  n,  by  Theorem  1.2.1. 

Example  1.2.3  For  each  nonnegative  integer  n,  let  xn  be  a  real  number  and  suppose 
that 

\xn+i  ~x„\  <  r\xn-xn-i\,    n>\,  (1.2.6) 
where  r  is  a  fixed  positive  number.  By  considering  (1.2.6)  for  n  =  1,2,  and  3,  we  find  that 

\x2 -xi\  <  r\x\  -  x0\, 

1*3  -  x2\  <  r\x2 -Xi\  <  r2\xi  -  x0\, 

\xa  -  x3\  <  r\x3  -  x2\  <  r3\xi  -  x0\. 

Therefore,  we  conjecture  that 

\xn-Xn-il  <  r"~Vi -x0\    if    n>\.  (1.2.7) 

This  is  trivial  for  n  =  1.  If  it  is  true  for  some  n,  then  (1.2.6)  and  (1.2.7)  imply  that 

\xn+\ -x„\  <  r(rn_1|xi --Xol),    so    \xn+i  -  x„\  <  rn\x\  -  x0|, 

which  is  proposition  (1.2.7)  with  n  replaced  by  n  +  1.  Hence,  (1.2.7)  is  true  for  every 
positive  integer  n,  by  Theorem  1.2.1.  ■ 

The  major  effort  in  an  induction  proof  (after  P\ ,  P2,  . . . ,  Pn,  ...  have  been  formulated) 
is  usually  directed  toward  showing  that  Pn  implies  P„+\ .  However,  it  is  important  to  verify 
Pi,  since  Pn  may  imply  Pn+\  even  if  some  or  all  of  the  propositions  P\,  P2,  . . . ,  Pn,  . . . 
are  false. 

Example  1.2.4  Let  Pn  be  the  proposition  that  2n  —  1  is  divisible  by  2.  If  Pn  is  true 
then  Pn+\  is  also,  since 

2n  +  1  =  {In  -  1)  +  2. 
However,  we  cannot  conclude  that  Pn  is  true  for  n  >  1 .  In  fact,  P„  is  false  for  every  n .  ■ 

The  following  formulation  of  the  principle  of  mathematical  induction  permits  us  to  start 
induction  proofs  with  an  arbitrary  integer,  rather  than  1,  as  required  in  Theorem  1.2.1. 
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Theorem  1.2.2  Let  no  be  any  integer  (positive,  negative,  or  zero).  Let  P„Q,  P„Q+i, 
... ,  Pn,  ...  be  propositions,  one  for  each  integer  n  >  no,  such  that 

(a)  Pno  is  true ; 

(b)  for  each  integer  n  >  no,  Pn  implies  Pn+\- 
Then  Pn  is  true  for  every  integer  n  >  no- 
Proof  Form  >  1,  let  Qm  be  the  proposition  defined  by  Qm  =  Pm+„0-i.  Then  Q\  = 
P„0  is  true  by  (a).  If  m  >  1  and  Qm  =  Pm+„Q-i  is  true,  then  Qm+\  =  Pm+„Q  is  true  by 
(b)  with«  replaced  by  m  +  «o  —  1-  Therefore,  Qm  is  true  for  all  aw  >  1  by  Theorem  1.2.1 
with  P  replaced  by  Q  and  n  replaced  by  m.  This  is  equivalent  to  the  statement  that  Pn  is 
true  for  all  n  >  no-  H 

Example  1.2.5  Consider  the  proposition  P„  that 

3n  +  16  >  0. 

If  Pn  is  true,  then  so  is  Pn+\,  since 

3(«  +  1)  +  16=  3n  +  3+  16 

=  (3«  +  16)  +  3>0+3  (by  the  induction  assumption) 
>  0. 

The  smallest  no  for  which  Pno  is  true  is  no  =  —5.  Hence,  Pn  is  true  for  n  >  —5,  by 
Theorem  1.2.2. 

Example  1.2.6  Let  P„  be  the  proposition  that 

n\-3n  >  0. 

If  Pn  is  true,  then 

(n  +  1)!  -  3"+1  =  «!(«  +  1)  -  3"+1 

>  3n(n  +  1)  —  3"+1    (by  the  induction  assumption) 
=  3"(»-2). 

Therefore,  Pn  implies  Pn+\  if  n  >  2.  By  trial  and  error,  no  =  7  is  the  smallest  integer 
such  that  PnQ  is  true;  hence,  Pn  is  true  for  n  >  7,  by  Theorem  1.2.2.  ■ 

The  next  theorem  is  a  useful  consequence  of  the  principle  of  mathematical  induction. 

Theorem  1.2.3  Let  no  be  any  integer  (positive,  negative,  or  zero).  Let  P„0,  P„Q+i,..., 
Pn,  . . .  be  propositions,  one  for  each  integer  n  >  no,  such  that 

(a)  Pno  is  true ; 

(b)  for  n  >  no,  Pn+\  is  true  if  PnQ,  Pno+i,. . . ,  Pn  are  all  true. 
Then  P„  is  true  for  n  >  no- 
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Proof  For  n  >  no,  let  Qn  be  the  proposition  that  P„Q,  Pno+\, . . .,  Pn  are  all  true.  Then 
2„0  is  true  by  (a).  Since  Qn  implies  Pn+\  by  (b),  and  Qn+i  is  true  if  Qn  and  .P„+i  are 
both  true,  Theorem  1.2.2  implies  that  Qn  is  true  for  all  n  >  uq.  Therefore,  P„  is  true  for 
all  n  >  no-  H 

Example  1.2.7  An  integer  p  >  1  is  a  prime  if  it  cannot  be  factored  as  p  =  rs  where 
r  and  s  are  integers  and  1  <  r,  s  <  p.  Thus,  2,  3,  5,  7,  and  1 1  are  primes,  and,  although  4, 
6,  8,  9,  and  10  are  not,  they  are  products  of  primes: 

4  =  2-2.    6=2-3,    8  =  2-2-2,    9=3-3,     10  =  2-5. 

These  observations  suggest  that  each  integer  n  >  2  is  a  prime  or  a  product  of  primes.  Let 
this  proposition  be  Pn.  Then  P2  is  true,  but  neither  Theorem  1.2.1  nor  Theorem  1.2.2 
apply,  since  Pn  does  not  imply  Pn+i  in  any  obvious  way.  (For  example,  it  is  not  evident 
from  24  =  2  ■  2  •  2  •  3  that  25  is  a  product  of  primes.)  However,  Theorem  1.2.3  yields  the 
stated  result,  as  follows.  Suppose  that  n  >  2  and  P^,  Pn  are  true.  Either  n  +  1  is  a 
prime  or 

n  +  1  =  rs,  (1.2.8) 

where  r  and  s  are  integers  and  1  <  r,  s  <  n,  so  Pr  and  Ps  are  true  by  assumption.  Hence,  r 
and  s  are  primes  or  products  of  primes  and  ( 1 .2.8)  implies  that  n  +  1  is  a  product  of  primes. 
We  have  now  proved  Pn+\  (that  n  +  1  is  a  prime  or  a  product  of  primes).  Therefore,  P „  is 
true  for  all  n  >  2,  by  Theorem  1.2.3. 


1.2  Exercises 


Prove  the  assertions  in  Exercises  1.2.1-1.2.6  by  induction. 


1.  The  sum  of  the  first  n  odd  integers  is  n 

2.  l2  +  22  +  ---  +  «2  = 


2l.2|        |2      n(n  +  l)(2n  +  1) 


6 

i       9  ,  ->      n(4n2  —  1) 

3.  l2  +  32  +  ---  +  (2«-l)2  =  -i—  -. 

4.  If  ai ,  a2,  ■  ■  ■ ,  <2«  are  arbitrary  real  numbers,  then 

[fli  +  ai  H  hflnl  <  |ai|  +  [02!  H  \-\an\. 

5.  If  a,  >  0,  i  >  1,  then 

(1  +  ai)(l  +a2)---(l  +  a«)  >  1  +  ai  +  -22  H  ha„. 

6.  IfO<a,  <  l,z  >  l,then 

(1  -ai)(l  -a2)---(l  —a„)  >  l-a\-a2  an. 
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7.  Suppose  that  so  >  0  and  s„  =  1  —  e  Sn~',  n  >  1.  Show  that  0  <  s„  <  1,  n  >  1. 

8.  Suppose  that     >  0,  xo  >  0,  and 

1  (  R  \ 

Xn  +  l  =  x  h  x„    ,     n  >  0. 

2  \*n  / 

Prove:  For  n  >  1,  x„  >  x„+i  >  \/7?  and 

1  (x0-VR)2 


x„  -  Vr  < 


2"  x0 


9.    Find  and  prove  by  induction  an  explicit  formula  for  an\ici\  =  1  and,  for  n  >  1, 
(a)  a„+i  =  - — — — — — —  (b)  an+\ 


(n  +  l)(2n  +1)  v  '  (2n  +  2)(2n  +  3) 

,  v  2n  +  1  ,  /  1\" 

(c)  a„+1  =   — -a„  (d)  an+i  =    1  +  -  a„ 

n  +  i  \       n  J 

10.  Let  a\  =  0  and  a„+\  =  (n  +  \)an  for  n  >  1,  and  let  f„  be  the  proposition  that 
an  =  n\ 

(a)  Show  that  P„  implies  Pn+\  ■ 

(b)  Is  there  an  integer  n  for  which  P„  is  true? 

1 1 .  Let  Pn  be  the  proposition  that 

(»  +  2)(»  -  1) 

1  +  2  H  \-n  =  . 

2 

(a)  Show  that  Pn  implies  Pn+\. 

(b)  Is  there  an  integer  n  for  which  Pn  is  true? 

12.  For  what  integers  n  is 

18" 

—  >   ? 

«!  (2m)! 

Prove  your  answer  by  induction. 

13.  Let  a  be  an  integer  >  2. 

(a)  Show  by  induction  that  if  n  is  a  nonnegative  integer,  then  n  =  aq  +  r,  where 
q  (quotient)  and  r  (remainder)  are  integers  and  0  <  r  <  a. 

(b)  Show  that  the  result  of  (a)  is  true  if  n  is  an  arbitrary  integer  (not  necessarily 
nonnegative). 

(c)  Show  that  there  is  only  one  way  to  write  a  given  integer  n  in  the  form  n  = 
aq  +  r,  where  q  and  r  are  integers  and  0  <  r  <  a. 

14.  Take  the  following  statement  as  given:  If  p  is  a  prime  and  a  and  b  are  integers  such 
that  p  divides  the  product  ab,  then  p  divides  a  orb. 
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(a)  Prove:  If  p,  pi, pk  are  positive  primes  and  p  divides  the  product  p\---  pk, 
then  p  =  pi  for  some  i  in  {1, ... ,  k). 

(b)  Let  7i  be  an  integer  >  1.  Show  that  the  prime  factorization  of  n  found  in 
Example  1.2.7  is  unique  in  the  following  sense:  If 

n=pi---pr    and    n  =  qiq2-  ■  -qs, 

where  p\, pT,  q\, qs  are  positive  primes,  then  r  =  s  and  {q\, . . .  ,qr} 
is  a  permutation  of  {p\, . . . ,  pr}. 

15.  Let  «i  =  02  =  5  and 

an+i  =  an  +  6a„_!,    n  >  2. 
Show  by  induction  that  a„  =  3"  -  (-2)"  if  n  >  1. 

16.  Let  <2i  =  2,  <22  =  0,  «3  =  —14,  and 

an+i  =  9an  -  23a„_i  +  15a„„2.    «  >  3. 
Show  by  induction  that  an  =  3""1  -  5""1  +  2,  n  >  1. 

17.  The  Fibonacci  numbers  {Fn}'^_1  are  defined  by  F\  =  F-i  =  1  and 

=  ^«  +  n  >  2. 


Prove  by  induction  that 


_,      (1  +  V5)«  -  (1  -  V5)B 

F„  =   =  .      71  >  1. 

2"V5 


18.     Prove  by  induction  that 


I 


yn{l-y)rdy  = 


(r  +  l)(r  +2)---(r  +«  +  1) 
if  n  is  a  nonnegative  integer  and  r  >  —  1 . 

19.     Suppose  that  m  and  71  are  integers,  with  0  <  m  <  n.  The  binomial  coefficient 
is  the  coefficient  of  tm  in  the  expansion  of  (1  +  i)n\  that  is, 


'  n 


<>+«)" =t(:V 

m=0  V  / 


m=0 

From  this  definition  it  follows  immediately  that 


'  n  \  In 
io)  =  \« 


=  1,    n  >  0. 


For  convenience  we  define 

I  n   \  In, 

.  =  0.     71  >  0. 

.-1    /      \«  +  l 
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(a)   Show  that 


"  +  1    =  ("    +  (    n     ),  0<m<n, 


and  use  this  to  show  by  induction  on  n  that 
I  n\  n\ 


0  <  m  <  n. 


\m  I      ml(n  —  m)\ 
(b)   Show  that 

£(-.rh=0    and  tfj=2. 


I      /  — '  \  171  . 

m=0  \    /  m=0  \  / 

(c)    Show  that 

(x  +  yf  =  J2  (n)xmyn-m. 

m=0  \) 

(This  is  the  binomial  theorem.) 

20.  Use  induction  to  find  an  «th  antiderivative  of  log  x,  the  natural  logarithm  of  x. 

21.  Let  fi(x\)  =  gi(xi)  =  x\.  For«  >  2,  let 

f„(xi,x2,  ...,xn)  =  fn-1(x1,x2,  ■  ■  ■  ,xn-i)  +  2n~2xn  + 

\fn-l(Xl  ,  X2 ,  ■  ■  ■  ,  Xn—\ )  —  2      xn  I 

and 

gn(Xl,X2,  ...,X„)  =  gn-1(xUX2,  ■  ■  -,X„-i)  +  2"~2X„  - 

\gn-i(xi,x2, xn-i)  -  2n~2xn\. 

Find  explicit  formulas  for  fn(x\,  x2, . . . ,  xn)  and  gn (x i ,  x2 , . . . ,  xn ) . 

22.  Prove  by  induction  that 

1  —  COS  2  TlX 

sinx  +  sin 3x  H  +  sin(2n  —  l)x  =   ,     n  >  1. 

2  sin^: 

HINT:  You  will  need  trigonometric  identities  that  you  can  derive  from  the  identities 

cosL4  —  B)  =  cos  A  cos  B  +  sin  A  sin  B, 
cosL4  +  B)  =  cos  A  cos  B  —  sin  A  sin  B, 

Take  these  two  identities  as  given. 
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23.     Suppose  that  ci\  <  a2  <  •  ■  •  <  a„  and  b\  <  b2  <  ■  ■  ■  <  bn.  Let  {£i,i2, ...€„}  be  a 
permutation  of  {1,2, ...,«},  and  define 

n 
i  =  l 

Show  that 

e(^,€2,...,4)  >  2(1,2,. ..,«). 

1.3  THE  REAL  LINE 

One  of  our  objectives  is  to  develop  rigorously  the  concepts  of  limit,  continuity,  differen- 
tiability, and  integrability,  which  you  have  seen  in  calculus.  To  do  this  requires  a  better 
understanding  of  the  real  numbers  than  is  provided  in  calculus.  The  purpose  of  this  section 
is  to  develop  this  understanding.  Since  the  utility  of  the  concepts  introduced  here  will  not 
become  apparent  until  we  are  well  into  the  study  of  limits  and  continuity,  you  should  re- 
serve judgment  on  their  value  until  they  are  applied.  As  this  occurs,  you  should  reread  the 
applicable  parts  of  this  section.  This  applies  especially  to  the  concept  of  an  open  covering 
and  to  the  Heine-Borel  and  Bolzano-Weierstrass  theorems,  which  will  seem  mysterious  at 
first. 

We  assume  that  you  are  familiar  with  the  geometric  interpretation  of  the  real  numbers  as 
points  on  a  line.  We  will  not  prove  that  this  interpretation  is  legitimate,  for  two  reasons:  (1) 
the  proof  requires  an  excursion  into  the  foundations  of  Euclidean  geometry,  which  is  not 
the  purpose  of  this  book;  (2)  although  we  will  use  geometric  terminology  and  intuition  in 
discussing  the  reals,  we  will  base  all  proofs  on  properties  (A)-(I)  (Section  1.1)  and  their 
consequences,  not  on  geometric  arguments. 

Henceforth,  we  will  use  the  terms  real  number  system  and  real  line  synonymously  and 
denote  both  by  the  symbol  R;  also,  we  will  often  refer  to  a  real  number  as  a  point  (on  the 
real  line). 

Some  Set  Theory 

In  this  section  we  are  interested  in  sets  of  points  on  the  real  line;  however,  we  will  consider 
other  kinds  of  sets  in  subsequent  sections.  The  following  definition  applies  to  arbitrary 
sets,  with  the  understanding  that  the  members  of  all  sets  under  consideration  in  any  given 
context  come  from  a  specific  collection  of  elements,  called  the  universal  set.  In  this  section 
the  universal  set  is  the  real  numbers. 

Definition  1.3.1  Let  S  and  T  be  sets. 

(a)  5  contains  T,  and  we  write  5  D  T  or  T  C  S,  if  every  member  of  T  is  also  in  S.  In 
this  case,  T  is  a  subset  of  S. 

(b)  5  —  T  is  the  set  of  elements  that  are  in  S  but  not  in  T. 

(c)  5  equals  T,  and  we  write  S  =  T,  if  5  contains  T  and  T  contains  5;  thus,  5  =  T  if 
and  only  if  5  and  T  have  the  same  members. 
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(d)  S  strictly  contains  T  if  S  contains  T  but  T  does  not  contain  5;  that  is,  if  every 
member  of  T  is  also  in  5,  but  at  least  one  member  of  S  is  not  in  T  (Figure  1.3.1). 

(e)  The  complement  of  S,  denoted  by  Sc ,  is  the  set  of  elements  in  the  universal  set  that 
are  not  in  5 . 

(f )  The  union  of  S  and  T,  denoted  by  5  U  T,  is  the  set  of  elements  in  at  least  one  of  S 
and  T  (Figure  1.3.1(b)). 

(g)  The  intersection  of  S  and  T,  denoted  by  S  fl  T,  is  the  set  of  elements  in  both  S  and 
T  (Figure  1.3.1(c)).  If  S  fl  T  =  0  (the  empty  set),  then  S  and  T  are  disjoint  sets 
(Figure  1.3.1(d)). 

(h)  A  set  with  only  one  member  xq  is  a  singleton  set,  denoted  by  {xq}. 


S  n  T  =  shaded  region  5  n  r  =  0 

(c)  (d) 

Figure  1.3.1 


Example  1.3.1  Let 

S  =  {jc|0<x<1},     T  =  {x\0<x<l  and  x  is  rational}  , 

and 

U  =  {x  |  0  <  x  <  1  and  x  is  irrational}  . 

Then  S  D  T  and  S  D  U,  and  the  inclusion  is  strict  in  both  cases.  The  unions  of  pairs  of 
these  sets  are 

S  U  T  =  S,    S  U  U  =  S,    and    T  U  U  =  S, 
and  their  intersections  are 

S  n  T  =  T,     S  nU  =  U,     and    T  (1  U  =  0. 
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Also, 

S  -  U  =  T    and    S  -  T  =  U.  ■ 

Every  set  S  contains  the  empty  set  0,  for  to  say  that  0  is  not  contained  in  5  is  to  say  that 
some  member  of  0  is  not  in  5,  which  is  absurd  since  0  has  no  members.  If  S  is  any  set, 
then 

(Sc)c  =  S    and    S  n  Sc  =  0. 
If  S  is  a  set  of  real  numbers,  then  S  U  Sc  =  BL 

The  definitions  of  union  and  intersection  have  generalizations:  If  3>  is  an  arbitrary  col- 
lection of  sets,  then  u{S|Se.F}is  the  set  of  all  elements  that  are  members  of  at  least 
one  of  the  sets  in  3? ,  and  fl  {S  |  S  €  is  the  set  of  all  elements  that  are  members  of  every 
set  in  .  The  union  and  intersection  of  finitely  many  sets  S\,  . . . ,  S„  are  also  written  as 
Uik=i  $k  and  flik=i  The  union  and  intersection  of  an  infinite  sequence  {Sk}kxL1  of  sets 
are  written  as  (JfcLi  $k  and  DjS=i  $k- 

Example  1.3.2  If  "F  is  the  collection  of  sets 

Sp  =  {x  |  p  <  x  <  1  +  p) ,    0  <  p  <  1/2, 

then 

{J{Sp\Sp&F}={x\0<x<  3/2}  and  f]  {Sp  \  Sp  e       =  {x  \  1/2  <  x  <  1} . 

Example  1.3.3  If,  for  each  positive  integer  k,  the  set  Sk  is  the  set  of  real  numbers 
that  can  be  written  as  i  =  mjk  for  some  integer  m,  then  Ufe=i  $k  is  the  set  of  rational 
numbers  and  HfcLi  ^  ^s  me  set  °f  integers. 


Open  and  Closed  Sets 

If  a  and  b  are  in  the  extended  reals  and  a  <  b,  then  the  open  interval  (a,b)  is  defined  by 

(a,b)  =  {x  |  a  <  x  <  b}  . 

The  open  intervals  (a,  oo)  and  (— oo,  b)  are  semi-infinite  if  a  and  are  finite,  and  (— oo,  oo) 
is  the  entire  real  line. 

Definition  1.3.2  If  xq  is  a  real  number  and  e  >  0,  then  the  open  interval  (xq— e,  Xo  +  e) 
is  an  e-neighborhood  of  xq.  If  a  set  5  contains  an  e-neighborhood  of  xq,  then  S  is  a 
neighborhood  of  xo,  and  xq  is  an  interior  point  of  5  (Figure  1.3.2).  The  set  of  interior 
points  of  5  is  the  interior  of  S,  denoted  by  5°.  If  every  point  of  S  is  an  interior  point  (that 
is,  S°  =  S),  then  5  is  open.  A  set  S  is  closed  if  Sc  is  open. 
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xo-<  xo  xo  +  e 
— <  •  )  


5  =  four  line  segments 
xQ  =  interior  point  of  5 

Figure  1.3.2 


The  idea  of  neighborhood  is  fundamental  and  occurs  in  many  other  contexts,  some  of 
which  we  will  see  later  in  this  book.  Whatever  the  context,  the  idea  is  the  same:  some  defi- 
nition of  "closeness"  is  given  (for  example,  two  real  numbers  are  "close"  if  their  difference 
is  "small"),  and  a  neighborhood  of  a  point  .To  is  a  set  that  contains  all  points  sufficiently 
close  to  xq. 

Example  1.3.4  An  open  interval  (a,b)  is  an  open  set,  because  if  xq  e  (a,b)  and 
e  <  min{xo  —  a,b  —  xq},  then 

Oo  -  €,  xQ  +  e)  c  (a,b). 

The  entire  line  K  =  (— oo,  oo)  is  open,  and  therefore  0  (=  Re)  is  closed.  However,  0  is 
also  open,  for  to  deny  this  is  to  say  that  0  contains  a  point  that  is  not  an  interior  point, 
which  is  absurd  because  0  contains  no  points.  Since  0  is  open,  R  (=  0C)  is  closed.  Thus, 
R  and  0  are  both  open  and  closed.  They  are  the  only  subsets  of  R  with  this  property 
(Exercise  1.3.18).  ■ 

A  deleted  neighborhood  of  a  point  .To  is  a  set  that  contains  every  point  of  some  neigh- 
borhood of  xq  except  for  xq  itself.  For  example, 

S  =  [x  |  0  <  \x  —  xq\  <  e) 

is  a  deleted  neighborhood  of  xq.  We  also  say  that  it  is  a  deleted  e -neighborhood  of  xq. 

Theorem  1.3.3 

(a)  The  union  of  open  sets  is  open. 

(b)  The  intersection  of  closed  sets  is  closed. 

These  statements  apply  to  arbitrary  collections,  finite  or  infinite,  of  open  and  closed  sets. 

Proof    (a)  Let  ~§  be  a  collection  of  open  sets  and 

S  =  U{G\G  . 

If  xq  e  S,  then  xq  e  Go  for  some  Go  in  ~§ ,  and  since  Go  is  open,  it  contains  some  e- 
neighborhood  of  xo-  Since  Go  C  S,  this  e -neighborhood  is  in  S,  which  is  consequently  a 
neighborhood  of  xq.  Thus,  S  is  a  neighborhood  of  each  of  its  points,  and  therefore  open, 
by  definition. 

(b)  Let  f  be  a  collection  of  closed  sets  and  T  =  C\  {F  \  F  €  3*  }.  Then  Tc  = 
U  {Fc  J  F  e  (Exercise  1.3.7)  and,  since  each  Fc  is  open,  Tc  is  open,  from  (a).  There- 
fore, T  is  closed,  by  definition.  H 
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Example  1.3.5  If  — oo  <  a  <  b  <  oo,  the  set 

[a,  b]  =  {x  |  a  <  x  <  b} 

is  closed,  since  its  complement  is  the  union  of  the  open  sets  (— oo,  a)  and  (b,  oo).  We  say 
that  [a,  b]  is  a  closed  interval.  The  set 

[a,  b)  =  [x  |  a  <  x  <  b} 

is  a  half-closed  or  half-open  interval  if  — oo  <  a  <  b  <  oo,  as  is 

(a,     =  {.r  |  a  <  x  <  ; 

however,  neither  of  these  sets  is  open  or  closed.  (Why  not?)  Semi-infinite  closed  intervals 
are  sets  of  the  form 

[a,  oo)  =  {x  |  a  <  x}     and    (— oo,  a]  =  {x  |  x  <  a}  , 

where  a  is  finite.  They  are  closed  sets,  since  their  complements  are  the  open  intervals 
(— oo,  a)  and  (a,  oo),  respectively.  ■ 

Example  1.3.4  shows  that  a  set  may  be  both  open  and  closed,  and  Example  1.3.5  shows 
that  a  set  may  be  neither.  Thus,  open  and  closed  are  not  opposites  in  this  context,  as  they 
are  in  everyday  speech. 

Example  1.3.6  From  Theorem  1.3.3  and  Example  1.3.4,  the  union  of  any  collection  of 
open  intervals  is  an  open  set.  (In  fact,  it  can  be  shown  that  every  nonempty  open  subset  of 
K  is  the  union  of  open  intervals.)  From  Theorem  1.3.3  and  Example  1.3.5,  the  intersection 
of  any  collection  of  closed  intervals  is  closed.  ■ 

It  can  be  shown  that  the  intersection  of  finitely  many  open  sets  is  open,  and  that  the 
union  of  finitely  many  closed  sets  is  closed.  However,  the  intersection  of  infinitely  many 
open  sets  need  not  be  open,  and  the  union  of  infinitely  many  closed  sets  need  not  be  closed 
(Exercises  1.3.8  and  1.3.9). 

Definition  1.3.4  Let  S  be  a  subset  of  Jt.  Then 

(a)  .To  is  a  limit  point  of  S  if  every  deleted  neighborhood  of  xq  contains  a  point  of  5. 

(b)  xq  is  a  boundary  point  of  5  if  every  neighborhood  of  .To  contains  at  least  one  point 
in  S  and  one  not  in  S .  The  set  of  boundary  points  of  S  is  the  boundary  of  5,  denoted 
by  dS.  The  closure  of  S,  denoted  by  S,  is  S  =  S  U  35. 

(c)  xq  is  an  isolated  point  of  S  if  xq  g  S  and  there  is  a  neighborhood  of  .To  that  contains 
no  other  point  of  S . 

(d)  xq  is  exterior  to  S  if  xq  is  in  the  interior  of  Sc .  The  collection  of  such  points  is  the 
exterior  of  S . 


Example  1.3.7  Let  S  =  (-oo,  -1]  U  (1,2)  U  {3}.  Then 
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(a)  The  set  of  limit  points  of  S  is  (—00,  —1]  U  [1,2]. 

(b)  as  =  {-1, 1,2,  3}  and  S  =  (-00,  -1]  U  [1,2]  U  {3}. 

(c)  3  is  the  only  isolated  point  of  S. 

(d)  The  exterior  of  S  is  (-1,  1)  U  (2,3)  U  (3,  00). 


Example  1.3.8  For«  >  1,  let 


In  = 


1 


1 


In  +  1  In 


and 


s=  U  v 


H  =  l 


Then 


(a)  The  set  of  limit  points  of  S  is  S  U  {0}. 

(b)  dS={x\x  =  Oorx  =  1/n  («  >  2)}  and  S  =  S  U  {0}. 

(c)  5  has  no  isolated  points. 

(d)  The  exterior  of  S  is 


(-00,  0)  U 


00 


+  2  In  +  1 


u 


00 


Example  1.3.9  Let  S  be  the  set  of  rational  numbers.  Since  every  interval  contains  a 
rational  number  (Theorem  1.1.6),  every  real  number  is  a  limit  point  of  S;  thus,  S  =  R. 
Since  every  interval  also  contains  an  irrational  number  (Theorem  1 . 1 .7),  every  real  number 
is  a  boundary  point  of  5;  thus  35  =  K.  The  interior  and  exterior  of  S  are  both  empty,  and 
S  has  no  isolated  points.  S  is  neither  open  nor  closed.  ■ 

The  next  theorem  says  that  S  is  closed  if  and  only  if  S  =  S  (Exercise  1.3.14). 

Theorem  1.3.5  A  set  S  is  closed  if  and  only  if  no  point  of  Sc  is  a  limit  point  of  S. 

Proof  Suppose  that  S  is  closed  and  xq  e  Sc .  Since  Sc  is  open,  there  is  a  neighborhood 
of  *o  that  is  contained  in  Sc  and  therefore  contains  no  points  of  S.  Hence,  xq  cannot  be  a 
limit  point  of  S.  For  the  converse,  if  no  point  of  Sc  is  a  limit  point  of  S  then  every  point  in 
Sc  must  have  a  neighborhood  contained  in  Sc .  Therefore,  Sc  is  open  and  S  is  closed.  TJ 

Theorem  1.3.5  is  usually  stated  as  follows. 

Corollary  1.3.6  A  set  is  closed  if  and  only  if  it  contains  all  its  limit  points. 

Theorem  1.3.5  and  Corollary  1.3.6  are  equivalent.  However,  we  stated  the  theorem  as 
we  did  because  students  sometimes  incorrectly  conclude  from  the  corollary  that  a  closed 
set  must  have  limit  points.  The  corollary  does  not  say  this.  If  S  has  no  limit  points,  then 
the  set  of  limit  points  is  empty  and  therefore  contained  in  S.  Hence,  a  set  with  no  limit 
points  is  closed  according  to  the  corollary,  in  agreement  with  Theorem  1.3.5.  For  example, 
any  finite  set  is  closed.  More  generally,  S  is  closed  if  there  is  a  S  >  0  such  \x  —  y  \  >  S  for 
every  pair  {x,  y}  of  distinct  points  in  S. 
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Open  Coverings 

A  collection  3i  of  open  sets  is  an  open  covering  of  a  set  S  if  every  point  in  5  is  contained 
in  a  set  H  belonging  to  M;  that  is,  if  S  C  U  {H  \  H  £  Jf). 

Example  1.3.10  The  sets 

Si  =  [0,1],  S2  =  {1,2, ...,«,...}, 
S3  =  \l, \ ,     and    S4  =  (0,  1) 


2  n 

are  covered  by  the  families  of  open  intervals 


#3 


1  1 


n  +  i  n  -  i 


0  <  x  <  1  j  ,  (A^=  positive  integer), 
n  =  1,2,...]  , 

n  =  1.2.. 


and 


^4  =  {(0,p)|0<p<  1}, 
respectively. 

Theorem  1.3.7  (Heine  Borel  Theorem)  If  M  is  an  open  covering  of  a  closed 
and  bounded  subset  S  of  the  real  line,  then  S  has  an  open  covering  M  consisting  of  finitely 
many  open  sets  belonging  to  3i . 

Proof  Since  5  is  bounded,  it  has  an  infimum  a  and  a  supremum  /3,  and,  since  5  is 
closed,  a  and  ft  belong  to  S  (Exercise  1.3.17).  Define 

St  =  S  n  [a,  t]    for  t  >  a, 

and  let 

F  =  {t  \  a  <  t  <  [3  and  finitely  many  sets  from  M  cover  St}  . 

Since  Sp  =  S,  the  theorem  will  be  proved  if  we  can  show  that  /3  e  F.  To  do  this,  we  use 
the  completeness  of  the  reals. 

Since  a  €  S,  Sa  is  the  singleton  set  {a},  which  is  contained  in  some  open  set  Ha  from 
3t  because  3i  covers  S;  therefore,  a  e  F .  Since  F  is  nonempty  and  bounded  above  by  /3, 
it  has  a  supremum  y.  First,  we  wish  to  show  that  y  =  p1.  Since  y  <  f3  by  definition  of  F , 
it  suffices  to  rule  out  the  possibility  that  y  <  fi-  We  consider  two  cases. 
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Case  1.  Suppose  that  y  <  p1  and  y  $  S.  Then,  since  5  is  closed,  y  is  not  a  limit  point 
of  S  (Theorem  1.3.5).  Consequently,  there  is  an  e  >  0  such  that 

[y  -e,  y  +  e]  n  S  =  0, 

However,  the  definition  of  y  implies  that  SY-e  has  a  finite  subcovering 
from  3t ,  while  Sy+e  does  not.  This  is  a  contradiction. 

Case  2.  Suppose  that  y  <  /3  and  y  e  S.  Then  there  is  an  open  set  Hy  in  M  that 
contains  y  and,  along  with  y,  an  interval  [y  —  e,  y  +  e]  for  some  positive  e.  Since  Sy-e  has 
a  finite  covering  {Hi  ,...,//„}  of  sets  from  ,  it  follows  that  Sy+e  has  the  finite  covering 
{Hi, . . . ,  Hn,  Hy}.  This  contradicts  the  definition  of  y. 

Now  we  know  that  y  =  fj,  which  is  in  5.  Therefore,  there  is  an  open  set  Hp  in  M  that 
contains  p1  and  along  with  p1,  an  interval  of  the  form  [p1  —  e,  /3  +  e],  for  some  positive  e. 
Since  S^-^  is  covered  by  a  finite  collection  of  sets  {Hi, . . . ,  H^},  Sp  is  covered  by  the 
finite  collection  {Hi,...,  H^,  Hp}.  Since  Sp  =  5,  we  are  finished.  TJ 

Henceforth,  we  will  say  that  a  closed  and  bounded  set  is  compact.  The  Heine-Borel 
theorem  says  that  any  open  covering  of  a  compact  set  S  contains  a  finite  collection  that 
also  covers  S.  This  theorem  and  its  converse  (Exercise  1.3.21)  show  that  we  could  just 
as  well  define  a  set  S  of  reals  to  be  compact  if  it  has  the  Heine-Borel  property;  that  is,  if 
every  open  covering  of  S  contains  a  finite  subcovering.  The  same  is  true  of  R",  which  we 
study  in  Section  5.1.  This  definition  generalizes  to  more  abstract  spaces  (called  topological 
spaces)  for  which  the  concept  of  boundedness  need  not  be  defined. 

Example  1.3.11  Since  Si  in  Example  1.3.10  is  compact,  the  Heine-Borel  theorem 
implies  that  Si  can  be  covered  by  a  finite  number  of  intervals  from  3ti .  This  is  easily  veri- 
fied, since,  for  example,  the  7.N  intervals  from  3t\  centered  at  the  points  =  kj 2N  (0  < 
k  <  2N  —  1)  cover  Sx. 

The  Heine-Borel  theorem  does  not  apply  to  the  other  sets  in  Example  1.3.10  since  they 
are  not  compact:  S2  is  unbounded  and  S3  and  S4  are  not  closed,  since  they  do  not  contain 
all  their  limit  points  (Corollary  1.3.6).  The  conclusion  of  the  Heine-Borel  theorem  does 
not  hold  for  these  sets  and  the  open  coverings  that  we  have  given  for  them.  Each  point  in 
S2  is  contained  in  exactly  one  set  from  M2,  so  removing  even  one  of  these  sets  leaves  a 
point  of  S2  uncovered.  If  ^3  is  any  finite  collection  of  sets  from  M3,  then 

-  <?U{H  I  H  e  J{3) 

for  n  sufficiently  large.  Any  finite  collection  {(0,  pi), . . . ,  (0,  pn)}  from  Ma  covers  only  the 
interval  (0,  pmdx),  where 

Pmax  =  max{pi,...,p„}  <  1. 
The  Bolzano  Weierstrass  Theorem 

As  an  application  of  the  Heine-Borel  theorem,  we  prove  the  following  theorem  of  Bolzano 
and  Weierstrass. 
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Theorem  1.3.8  (Bolzano  Weierstrass  Theorem)  Every  bounded  infinite  set 
of  real  numbers  has  at  least  one  limit  point. 

Proof  We  will  show  that  a  bounded  nonempty  set  without  a  limit  point  can  contain  only 
a  finite  number  of  points.  If  S  has  no  limit  points,  then  S  is  closed  (Theorem  1.3.5)  and 
every  point  x  of  S  has  an  open  neighborhood  Nx  that  contains  no  point  of  S  other  than  x . 
The  collection 

M  =  {Nx\xe  S) 

is  an  open  covering  for  S.  Since  S  is  also  bounded,  Theorem  1.3.7  implies  that  S  can  be 
covered  by  a  finite  collection  of  sets  from  3i,  say  NXI,  . . . ,  NXn.  Since  these  sets  contain 
only  x\,       xn  from  S,  it  follows  that  S  =  {x\, , , , ,  xn  }.  H 

1.3  Exercises 


1.  Find  5  n  T,  (S  n  T)c,  Sc  DTC,SU  T,  (S  U  T  f ,  and  Sc  U  Tc. 

(a)  5  =  (0, 1),  T  =  [|,  §]     (b)  S  =  {x\x2>  4},  T  =  {x  \  x2  <  9) 

(c)  S  =  (-oo,  oo),  T  =  0     (d)  S  =  (-oo,-l),  T  =  (1,  oo) 

2.  Let  Sk  =  (1  -  l/k,2+  l/k],k  >  1.  Find 

OO  OO  OO  OO 

(a)Ufc    (b)fV*    (C)IKC  wn^ 

k=l  k=l  k=l  k=l 

3.  Prove:  If  A  and  5  are  sets  and  there  is  a  set  X  such  that  A  U  X  =  B  U  X  and 
y4  n  X  =  B  n  X,  then  A  =  B. 

4.  Find  the  largest  e  such  that  5  contains  an  e-neighborhood  of  xq. 
(a)x0=!,S  =  [i,l)       (b)x0  =  |,S  =  [I  I] 

(c)  x0  =  5,  S  =  (-1, oo)     (d)  x0  =  1,  S  =  (0,2) 

5.  Describe  the  following  sets  as  open,  closed,  or  neither,  and  find  5°,  (Se)°,  and 
(S°)e. 

(a)  S  =  (-1,2)  U  [3,  oo)     (b)  S  =  (-oo,  1)  U  (2,  oo) 
(c)  S  =  [-3,  -2]  U  [7,  8]     (d)S={x\x  =  integer} 

6.  Prove  that  (5  n  T)c  =  Sc  U  Tc  and  (S  U  T)e  =  Sc  n  Te. 

7.  Let  !F  be  a  collection  of  sets  and  define 

/  =  n  {F  |  F  €  and     [/  =  U  {F  \  F  e  . 

Prove  that  (a)  7C  =  U  {Fc  |  F  e       and  (b)  C/c  =  {nF  |  F  e  F}. 

8.  (a)    Show  that  the  intersection  of  finitely  many  open  sets  is  open. 
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(b)    Give  an  example  showing  that  the  intersection  of  infinitely  many  open  sets 
may  fail  to  be  open. 

9.     (a)    Show  that  the  union  of  finitely  many  closed  sets  is  closed. 

(b)    Give  an  example  showing  that  the  union  of  infinitely  many  closed  sets  may 
fail  to  be  closed. 

10.  Prove: 

(a)  If  U  is  a  neighborhood  of  xo  and  U  C  V,  then  V  is  a  neighborhood  of  xq. 

(b)  If  U\,  . . . ,  Un  are  neighborhoods  of  xo,  so  is  (~]"=l  Uj. 

11.  Find  the  set  of  limit  points  of  5,  35,  5,  the  set  of  isolated  points  of  5,  and  the 
exterior  of  5. 

(a)  5  =  (-oo,-2)U(2,3)U{4}U(7,oo) 

(b)  5  =  {all  integers} 

(c)  5  =  U  {(«,  n  +  1)  |  n  =  integer} 

(d)  5  =  {x\x  =  \/n,n  =  1,2,3,...} 

12.  Prove:  A  limit  point  of  a  set  5  is  either  an  interior  point  or  a  boundary  point  of  5. 

13.  Prove:  An  isolated  point  of  S  is  a  boundary  point  of  Sc . 

14.  Prove: 

(a)  A  boundary  point  of  a  set  5  is  either  a  limit  point  or  an  isolated  point  of  5. 

(b)  A  set  5  is  closed  if  and  only  if  5  =  S. 

15.  Prove  or  disprove:  A  set  has  no  limit  points  if  and  only  if  each  of  its  points  is 
isolated. 

16.  (a)   Prove:  If  S  is  bounded  above  and  fi  =  sup  S,  then  /}  €  35. 
(b)    State  the  analogous  result  for  a  set  bounded  below. 

17.  Prove:  If  5  is  closed  and  bounded,  then  inf  5  and  sup  5  are  both  in  5. 

18.  If  a  nonempty  subset  5  of  R  is  both  open  and  closed,  then  5  =  R. 

19.  Let  5  be  an  arbitrary  set.  Prove:  (a)  35  is  closed,  (b)  5°  is  open,  (c)  The  exterior 
of  5  is  open,  (d)  The  limit  points  of  5  form  a  closed  set.  (e)  (5)  =  5. 

20.  Give  counterexamples  to  the  following  false  statements. 

(a)  The  isolated  points  of  a  set  form  a  closed  set. 

(b)  Every  open  set  contains  at  least  two  points. 

(c)  If  5i  and  52  are  arbitrary  sets,  then  3(5i  U  52)  =  35i  U  352- 

(d)  If  5i  and  52  are  arbitrary  sets,  then  3(5i  n  52)  =  35i  n  352. 

(e)  The  supremum  of  a  bounded  nonempty  set  is  the  greatest  of  its  limit  points. 

(f )  If  5  is  any  set,  then  3(35)  =  35. 

(g)  If  5  is  any  set,  then  35  =  35. 

(h)  If  5i  and  52  are  arbitrary  sets,  then  (5i  U  52)°  =  5j>  U  5°. 
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21.  Let  5  be  a  nonempty  subset  of  R  such  that  if  3t  is  any  open  covering  of  5,  then  5 
has  an  open  covering  X  comprised  of  finitely  many  open  sets  from  M.  Show  that 
S  is  compact. 

22.  A  set  S  is.  in  a  set  T  if  5  C  T  C  S. 

(a)  Prove:  If  S  and  T  are  sets  of  real  numbers  and  S  C  T,  then  S  is  dense  in  T 
if  and  only  if  every  neighborhood  of  each  point  in  T  contains  a  point  from  S. 

(b)  State  how  (a)  shows  that  the  definition  given  here  is  consistent  with  the  re- 
stricted definition  of  a  dense  subset  of  the  reals  given  in  Section  1.1. 


23.  Prove: 


(a)  (Si  n  S2)°  =  S°  n  S2° 


(b)  s°  u  s2°  c  (Si  u  s2y 


24.  Prove: 


(a)  3(Si  U  S2)  C  aSi  U  3S2 

(c)  as  c  as 

(e)  d(S-T)  cdSUdT 


(b)  3(Si  n  S2)  C  dSi  U  3S2 
(d)  3S  =  3SC 


CHAPTER  2 


Differential  Calculus  of 
Functions  of  One  Variable 


IN  THIS  CHAPTER  we  study  the  differential  calculus  of  functions  of  one  variable. 

SECTION  2.1  introduces  the  concept  of  function  and  discusses  arithmetic  operations  on 
functions,  limits,  one-sided  limits,  limits  at  ±00,  and  monotonic  functions. 

SECTION  2.2  defines  continuity  and  discusses  removable  discontinuities,  composite  func- 
tions, bounded  functions,  the  intermediate  value  theorem,  uniform  continuity,  and  addi- 
tional properties  of  monotonic  functions. 

SECTION  2.3  introduces  the  derivative  and  its  geometric  interpretation.  Topics  covered  in- 
clude the  interchange  of  differentiation  and  arithmetic  operations,  the  chain  rule,  one-sided 
derivatives,  extreme  values  of  a  differentiable  function,  Rolle's  theorem,  the  intermediate 
value  theorem  for  derivatives,  and  the  mean  value  theorem  and  its  consequences. 

SECTION  2.4  presents  a  comprehensive  discussion  of  L'Hospital's  rule. 

SECTION  2.5  discusses  the  approximation  of  a  function  /  by  the  Taylor  polynomials  of 
/  and  applies  this  result  to  locating  local  extrema  of  /.  The  section  concludes  with  the 
extended  mean  value  theorem,  which  implies  Taylor's  theorem. 

2.1  FUNCTIONS  AND  LIMITS 

In  this  section  we  study  limits  of  real-valued  functions  of  a  real  variable.  You  studied 
limits  in  calculus.  However,  we  will  look  more  carefully  at  the  definition  of  limit  and  prove 
theorems  usually  not  proved  in  calculus. 

A  rule  /  that  assigns  to  each  member  of  a  nonempty  set  D  a  unique  member  of  a  set  Y 
is  a  function  from  D  to  Y .  We  write  the  relationship  between  a  member  x  of  D  and  the 
member  y  of  Y  that  /  assigns  to  x  as 

y  =  fM- 

The  set  D  is  the  domain  of  /,  denoted  by  Df.  The  members  of  Y  are  the  possible  values 
of  /.  If  yo  e  Y  and  there  is  an  xq  in  D  such  that  / (xo)  =  yo  then  we  say  that  /  attains 
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or  assumes  the  value  yo.  The  set  of  values  attained  by  /  is  the  range  of  /.  A  real-valued 
function  of  a  real  variable  is  a  function  whose  domain  and  range  are  both  subsets  of  the 
reals.  Although  we  are  concerned  only  with  real-valued  functions  of  a  real  variable  in  this 
section,  our  definitions  are  not  restricted  to  this  situation.  In  later  sections  we  will  consider 
situations  where  the  range  or  domain,  or  both,  are  subsets  of  vector  spaces. 

Example  2.1.1  The  functions  /,  g,  and  h  defined  on  (— oo,  oo)  by 

f(x)  =  x2,    g(x)  =  sinx,    and    h(x)  =  ex 
have  ranges  [0,  oo),  [—1,  1],  and  (0,  oo),  respectively. 

Example  2.1.2  The  equation 

[f(x)]2  =  x  (2.1.1) 

does  not  define  a  function  except  on  the  singleton  set  {0}.  If  x  <  0,  no  real  number  satisfies 
(2.1.1),  while  if  x  >  0,  two  real  numbers  satisfy  (2.1.1).  However,  the  conditions 

[f(x)]2  =  x    and  /(jc)>0 

define  a  function  /  on  D  f  =  [0,  oo)  with  values  / (x)  =  *fx.  Similarly,  the  conditions 

[g(x)]2  =  x    and     g(x)  <  0 

define  a  function  g  on  Dg  =  [0,  oo)  with  values  g(x)  =  —*J~x.  The  ranges  of  /  and  g  are 
[0,  oo)  and  (— oo,  0],  respectively.  ■ 

It  is  important  to  understand  that  the  definition  of  a  function  includes  the  specification 
of  its  domain  and  that  there  is  a  difference  between  /,  the  name  of  the  function,  and  / (x), 
the  value  of  /  at  x.  However,  strict  observance  of  these  points  leads  to  annoying  verbosity, 
such  as  "the  function  /  with  domain  (— oo,  oo)  and  values  / (x)  =  x."  We  will  avoid  this 
in  two  ways:  (1)  by  agreeing  that  if  a  function  /  is  introduced  without  explicitly  defining 
D  f,  then  D y  will  be  understood  to  consist  of  all  points  x  for  which  the  rule  defining 
/ (x)  makes  sense,  and  (2)  by  bearing  in  mind  the  distinction  between  /  and  / (x),  but  not 
emphasizing  it  when  it  would  be  a  nuisance  to  do  so.  For  example,  we  will  write  "consider 
the  function  f(x)  =  Vl  —  x2,"  rather  than  "consider  the  function  /  defined  on  [—1,  1] 
by  f(x)  =  Vl  —  x2,"  or  "consider  the  function  g(x)  =  1/ sinx,"  rather  than  "consider 
the  function  g  defined  for  x  ^  kit  (k  =  integer)  by  g(x)  =1/  sinx."  We  will  also  write 
f  =  c  (constant)  to  denote  the  function  /  defined  by  f(x)  =  c  for  all  x. 

Our  definition  of  function  is  somewhat  intuitive,  but  adequate  for  our  purposes.  More- 
over, it  is  the  working  form  of  the  definition,  even  if  the  idea  is  introduced  more  rigorously 
to  begin  with.  For  a  more  precise  definition,  we  first  define  the  Cartesian  product  X  x  Y 
of  two  nonempty  sets  X  and  Y  to  be  the  set  of  all  ordered  pairs  (x,  y)  such  that  x  e  X  and 
y  €  Y;  thus, 

X  x  Y  =  {(x,y)  |  x  €  X,  y  e  Y)  . 
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A  nonempty  subset  /  of  X  x  Y  is  a  function  if  no  x  in  X  occurs  more  than  once  as  a  first 
member  among  the  elements  of  /.  Put  another  way,  if  (x,  y)  and  (x,  y\)  are  in  /,  then 
y  =  ji.  The  set  of  x's  that  occur  as  first  members  of  /  is  the  of  /.  If  x  is  in  the  domain 
of  /,  then  the  unique  y  in  Y  such  that  (x,  y)  e  /  is  the  value  of  f  at  x,  and  we  write 
y  =  f  (x).  The  set  of  all  such  values,  a  subset  of  Y,  is  the  range  of  f. 

Arithmetic  Operations  on  Functions 

Definition  2.1.1  If  Df  r\Dg  ^  0,  then  f  +  g,  f  -g,  and  fg  are  defined  on  Dfr\Dg 
by 

(/  +  g)(x)  =  f{x)  +  g{x), 
(f  -g)(x)  =  f(x)-g(x), 

and 

(fg)(x)  =  f(x)g(x). 

The  quotient  f/g  is  defined  by 


for  x  in  D f  fl  £)ff  such  that  ^(x)  ^  0. 

Example  2.1.3  If  /(x)  =  V4-jc2  and  g(jt)  =  V^7!,  then  D/  =  [-2,2]  and 
Dg  =  [1,  oo),  so  /  +  g,  f  —  g,  and  /g  are  defined  on  D f  fl  Z)g  =  [1,2]  by 

(/  +  g)(x)  =  V4-*2  +  V^T, 

and     

(fg)(x)  =  (VT^XV^T)  =  V(4-*2)(*-l).  (2.1.2) 

The  quotient  f/g  is  defined  on  (1,  2]  by 

Although  the  last  expression  in  (2.1.2)  is  also  defined  for  — oo  <  x  <  —2,  it  does  not 
represent  fg  for  such  x,  since  /  and  g  are  not  defined  on  (— oo,  —2]. 

Example  2.1.4  If  c  is  a  real  number,  the  function  cf  defined  by  (cf)(x)  =  cf(x)can 
be  regarded  as  the  product  of  /  and  a  constant  function.  Its  domain  is  D  f .  The  sum  and 
product  of  n  (>  2)  functions  f\,       fn  are  defined  by 

(fi  +  fi  +  •••  +  /„)(*)  =  fi(x)  +  h(x)  +  •••  +  fn(x) 
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and 

CA/2 -/»)(*)  =  /1W/2W-AW  (2.1.3) 

on  Z)  =  nf=i  ^ /  >  provided  that  Z)  is  nonempty.  If  /1  =  fz  =  ■  ■  ■  =  fn>  then  (2.1.3) 
defines  the  nth  power  of  f : 

(/")«  =  (/«)"■ 
From  these  definitions,  we  can  build  the  set  of  all  polynomials 

p(x)  =  ao  +  a\X  +  1-  a„xn, 

starting  from  the  constant  functions  and  f(x)  =  x.  The  quotient  of  two  polynomials  is  a 
rational  function 

ao  +  a\x  H  V  anxn 

b0  +  b\x  H  h  dmxm 

The  domain  of  r  is  the  set  of  points  where  the  denominator  is  nonzero. 


Limits 

The  essence  of  the  concept  of  limit  for  real-valued  functions  of  a  real  variable  is  this:  If  L 
is  a  real  number,  then  limx^xo  / (x)  =  L  means  that  the  value  / (x)  can  be  made  as  close 
to  L  as  we  wish  by  taking  x  sufficiently  close  to  Xo-  This  is  made  precise  in  the  following 
definition. 


x0-5  x0  x0  +  5 


Figure  2.1.1 
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Definition  2.1.2  We  say  that  f(x)  approaches  the  limit  L  as  x  approaches  xq,  and 
write 

lim  f(x)  =  L, 

x^>x0 

if  /  is  defined  on  some  deleted  neighborhood  of  xq  and,  for  every  e  >  0,  there  is  a  S  >  0 
such  that 

\f(x)-L\<e  (2.1.4) 

if 

0  <  |x  —  x0|  <  <5.  (2.1.5) 
Figure  2.1.1  depicts  the  graph  of  a  function  for  which  limx^xo  / (x)  exists. 

Example  2.1.5  If  c  and  x  are  arbitrary  real  numbers  and  / (x)  =  cx,  then 

lim  f(x)  =  cxq. 

To  prove  this,  we  write 

I/O)  -  cx0\  =  \cx  -  cx0\  =  \c\\x  -  x0\. 

If  c  7^  0,  this  yields 

|/(x)-cx0|<e  (2.1.6) 

if 

\x  —  xq\  <  8, 

where  S  is  any  number  such  that  0<<5<e/|c|.Ifc  =  0,  then  /  (x)  —  cxq  =  0  for  all  x, 
so  (2.1.6)  holds  for  all  x.  ■ 

We  emphasize  that  Definition  2.1.2  does  not  involve  f(xo),  or  even  require  that  it  be 
defined,  since  (2.1.5)  excludes  the  case  where  x  =  xq. 

Example  2.1.6  If 

/ (x)  =  x  sin  — ,  x^O, 
x 

then 

lim  f(x)  =  0 

x^-0 

even  though  /  is  not  defined  at  xq  =  0,  because  if 

0  <  |x|  <  8  =  e, 

then 


l/0)-o|  = 

On  the  other  hand,  the  function 


1 

x  sin  — 

x 


<    X    <  €. 


g(x)  =  sin  — ,    x  ^  0, 

x 

has  no  limit  as  x  approaches  0,  since  it  assumes  all  values  between  —1  and  1  in  every 
neighborhood  of  the  origin  (Exercise  2. 1 .26).  ■ 
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The  next  theorem  says  that  a  function  cannot  have  more  than  one  limit  at  a  point. 
Theorem  2.1.3  If  \imx^XQ  f(x)  exists,  then  it  is  unique;  that  is,  if 


lim  fix)  =  L\    and      lim  f(x)  =  Lj.  (2.1.7) 

X— *XQ  X~^>XQ 


then  L\  =  Lj- 


Proof  Suppose  that  (2.1.7)  holds  and  let  e  >  0.  From  Definition  2.1.2,  there  are  positive 
numbers  8\  and  82  such  that 

\f(x)-Li\<e    if    0  <\x-x0\  <St,     j  =  1,2. 

If  8  =  min(<5i,  82),  then 

\L1-L2\  =  \L1-f(x)  +  f(x)-L2\ 

<\L1-f(x)\  +  \f(x)-L2\<2e    if  0<\x-x0\<8. 

We  have  now  established  an  inequality  that  does  not  depend  on  x;  that  is, 

IL1-L2I  <2e. 

Since  this  holds  for  any  positive  e,  L\  =  L2.  H 
Definition  2. 1 .2  is  not  changed  by  replacing  (2. 1 .4)  with 

\f(x)-L\<Ke,  (2.1.8) 

where  K  is  a  positive  constant,  because  if  either  of  (2.1.4)  or  (2.1.8)  can  be  made  to  hold 
for  any  e  >  0  by  making  \x  —  Xo\  sufficiently  small  and  positive,  then  so  can  the  other 
(Exercise  2.1.5).  This  may  seem  to  be  a  minor  point,  but  it  is  often  convenient  to  work  with 
(2.1.8)  rather  than  (2. 1 .4),  as  we  will  see  in  the  proof  of  the  following  theorem. 

A  Useful  Theorem  about  Limits 
Theorem  2.1.4  // 


lim  f(x)  =  L\    and      lim  g(x)  =  L2,  (2.1.9) 

X— >XQ  X— >XQ 


then 


lim  (/  +  g)(x)  =  U  +  L2,  (2.1.10) 

x->x0 

lim  (/  -  g)(jc)  =  Li  -  L2,  (2.1.11) 

X— >XQ 

lim  (fg)(x)  =  LXL2,  (2.1.12) 

X~ >X§ 

and,  ifL2  ^  0,  (2.1.13) 


lim     —    (x)  =  — .  (2.1.14) 

x~*xa 
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Proof   From  (2.1.9)  and  Definition  2.1.2,  if  e  >  0,  there  is  a  8\  >  0  such  that 

|/(*)-Z.i|<e  (2.1.15) 

if  0  <  \x  —  xq\  <  Si,  and  a  82  >  0  such  that 

\g(x)-L2\<€  (2.1.16) 

if  0  <  \x  —  xq\  <  82.  Suppose  that 

0  <  |jc-jco|  <  8  =  mm(8i.82),  (2.1.17) 

so  that  (2.1.1 5)  and  (2. 1.16)  both  hold.  Then 

|(/  ±  g)(x)  -  (Li  ±  L2)|  =  \(f(x)  -  Li)  ±  (g(x)  -  L2)\ 

<  \f(x)-Li\  +  \g(x)-L2\<2e, 

which  proves  (2.1.10)  and  (2.1.11). 

To  prove  (2.1.12),  we  assume  (2.1.17)  and  write 

\(fg)(x)-L1L2\  =  \fXx)g(x)-L1L2\ 

=  \f(x)(g(x)-L2)  +  L2(f(x)-L1)\ 

<  \f(x)\\g(x)-L2\  +  \L2\\f(x)-Li\ 

<  (|/(x)|  +  \L2\)e    (from  (2.1.15)  and  (2.1.16)) 
<(\f{x)-Ll\  +  \Ll\  +  \L2\)e 

<{e  +  \Li\  +  \L2\)e    from  (2.1.15) 
<{\  +  \Li\  +  \L2\)e 

if  e  <  1  and  x  satisfies  (2.1.17).  This  proves  (2.1.12). 

To  prove  (2.1.14),  we  first  observe  that  if  L2  ^  0,  there  is  a  #3  >  0  such  that 

\g(x)-L2\<^, 

so 

\g{x)\  >  J^i  (2.1.18) 

if 

0  <  \x  -Xq\  <  83. 

To  see  this,  let  L  =  L2  and  e  =  \L2\/2  in  (2.1.4).  Now  suppose  that 

0  <  \x  —  xq\  <  min(<5i ,  82,  83), 
so  that  (2.1.15),  (2.1.16),  and  (2.1.18)  all  hold.  Then 
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'Am 


f(x)  LX 

L2 

|L2/(*)-Llg(*)| 
\g(x)L2\ 

-L-\Lzfix)-Llgix) 


2 


-2| 

2 
2 

il7p( 


L2[fix)  -Lt]  +  L\ [L2  —  gix)]\  (from (2.1.18)) 
-\L2\\fix)-L1\  +  \L1\\L2-gix)\] 


\L2\  +  |Li|)e    (from  (2.1.15)  and  (2.1.16)). 


This  proves  (2.1.14).  TJ 

Successive  applications  of  the  various  parts  of  Theorem  2.1.4  permit  us  to  find  limits 
without  the  e-S  arguments  required  by  Definition  2.1.2. 

Example  2.1.7  Use  Theorem  2.1.4  to  find 

9  —  x2 

lim   and      lim  (9  -  x2)ix  +  1). 

x->2  X  +  1  x~*2 

Solution  If  c  is  a  constant,  then  limx^xo  c  =  c,  and,  from  Example  2. 1 .5,  limx^xo  x  = 
xq.  Therefore,  from  Theorem  2.1.4, 

lim  (9  -  x2)  =  lim  9  -  lim  x2 

x— ±2  x— >2  x— >2 

=  lim  9  -  ( lim  x)2 

x—>2  x—>2 


and 


Therefore, 


and 


=  9-2^ 


lim  (x  +  1)  =  lim  x  +  lim  1  =  2  +  1  =  3. 

x— *2  x—>-2  x—f-2 


9-x2  _l^9-^  5 
x-^2  x  +  1        lim(;c  +  1)  3 

x->2 


lim 


lim(9-x2)(x  +  1)  =  lim (9-  x2)  lim(x  +  1)  =  5-  3  =  15. 

x— >2  x— *2  x—*2 


One-Sided  Limits 


The  function 


/ (x)  =  2x  sin  six 
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satisfies  the  inequality 

l/WI<6 

if  0  <  x  <  S  =  e/2.  However,  this  does  not  mean  that  lirn^o  f(x)  =  0,  since  /  is 
not  defined  for  negative  x,  as  it  must  be  to  satisfy  the  conditions  of  Definition  2.1.2  with 
A'o  =  0  and  L  =  0.  The  function 

g(x)  =  x  +  — ,    x  ^  0, 
x 

can  be  rewritten  as 

,  »      \x  +  1,    x  >  0, 
*«  =  jx-l,  *<0: 

hence,  every  open  interval  containing       =  0  also  contains  points  x\  and  x-i  such  that 
—  g(x-ij\  is  as  close  to  2  as  we  please.  Therefore,  limx^,xo         does  not  exist 
(Exercise  2.1.26). 

Although  / (x)  and  g(x)  do  not  approach  limits  as  x  approaches  zero,  they  each  exhibit 
a  definite  sort  of  limiting  behavior  for  small  positive  values  of  x,  as  does  g(x)  for  small 
negative  values  of  x.  The  kind  of  behavior  we  have  in  mind  is  defined  precisely  as  follows. 


lim      /(*)  =  X  lim       /(*)  =  (j. 


Figure  2.1.2 


Definition  2.1.5 

(a)    We  say  that  / (x)  approaches  the  left- 
and  write 

lim 

x^>x0 

if  /  is  defined  on  some  open  interval 
such  that 

[/(*)- L[  <e 


hand  limit  L  as  x  approaches  Xq  from  the  left, 
_f(x)  =  L, 

(a,  Xq)  and,  for  each  e  >  0,  there  is  a  S  >  0 

if      Xq  —  S  <  X  <  Xq. 
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(b)    We  say  that  / (x)  approaches  the  right-hand  limit  L  as  x  approaches  Xq  from  the 
right,  and  write 

lim   f(x)  =  L, 

if  /  is  defined  on  some  open  interval  (xo,  b)  and,  for  each  e  >  0,  there  is  a  S  >  0 
such  that 

\f(x)  —  L\  <  e    if    xo  <  x  <  xo  +  S.  ■ 

Figure  2.1.2  shows  the  graph  of  a  function  that  has  distinct  left-  and  right-hand  limits  at 
a  point  Xo- 


Example  2.1.8  Let 


f(x)  =  —,  x^O. 
\x\ 


If  x  <  0,  then  f(x)  =  —x/x  =  —1,  so 


lim  f{x)  =  -1. 


If  x  >  0,  then  f(x)  =  x/x  =  I,  so 


lim  f(x)  =  1. 

x->0  + 


Example  2.1.9  Let 


If  x  <  0,  then 


so 


*  +  |*|(1 +  s)  .1 
=   sin  — ,    x  5&  0. 


=  —x  sin  — , 

x 


lim  g(x)  =  0. 

jc— »-0— 


since 


1 


<   X  <€ 


\g(x)  -  0|  = 

if  —e  <  x  <  0;  that  is,  Definition  2.1.5(a)  is  satisfied  with  S  =  e.  If  x  >  0,  then 

g(x)  =  (2  +  x)  sin  -, 


which  takes  on  every  value  between  —2  and  2  in  every  interval  (0,5).  Hence,  g(x)  does  not 
approach  a  right-hand  limit  at  x  approaches  0  from  the  right.  This  shows  that  a  function 
may  have  a  limit  from  one  side  at  a  point  but  fail  to  have  a  limit  from  the  other  side. 


40    Chapter  2  Differential  Calculus  of  Functions  of  One  Variable 


Example  2.1.10  We  leave  it  to  you  to  verify  that 

lim  f  —  +  jc  |  =  1, 

x^0  +  \  X  J 

lim  [  Jfi  +  x]  =  -1, 

x-tf>-  \  x  J 

lim  x  sin  -Jx  =  0, 

x->0  + 

and  limx_>o-  sin  ^/J  does  not  exist.  ■ 

Left-  and  right-hand  limits  are  also  called  one-sided  limits.  We  will  often  simplify  the 
notation  by  writing 

lim   f(x)  =  /(xo— )    and       lim   f(x)  =  f(xo+). 

X— *XQ  —  X— >XQ  + 

The  following  theorem  states  the  connection  between  limits  and  one-sided  limits.  We 
leave  the  proof  to  you  (Exercise  2.1.12). 

Theorem  2.1.6  Afunction  f  has  a  limit  at  xq  if and  only  if it  has  left-  and  right-hand 
limits  at  Xq,  and  they  are  equal.  More  specifically, 

lim  f{x)  =  L 

x^>xo 

if  and  only  if 

f(xo+)  =  f(x0-)  =  L. 

With  only  minor  modifications  of  their  proofs  (replacing  the  inequality  0  <  \x  —  xq  \  <  S 
by  xo  —  S  <  x  <  io  or  io  <  x  <  xq  +  S),  it  can  be  shown  that  the  assertions  of  Theo- 
rems 2.1.3  and  2.1.4  remain  valid  if  "limx_>.xo"  is  replaced  by  "limx^xo_"  or  "limx_>xo+" 
throughout  (Exercise  2.1.13). 

Limits  at  ±oo 

Limits  and  one-sided  limits  have  to  do  with  the  behavior  of  a  function  /  near  a  limit  point 
of  D  f .  It  is  equally  reasonable  to  study  /  for  large  positive  values  of  x  if  D  f  is  unbounded 
above  or  for  large  negative  values  of  x  if  D  f  is  unbounded  below. 

Definition  2.1.7  We  say  that  f(x)  approaches  the  limit  L  as  x  approaches  oo,  and 
write 

lim  f{x)  =  L, 

x— >oo 

if  /  is  defined  on  an  interval  (a,  oo)  and,  for  each  e  >  0,  there  is  a  number  p1  such  that 

\f(x)-L\<€       if    x>f3.  * 
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Figure  2.1.3  provides  an  illustration  of  the  situation  described  in  Definition  2.1.7. 


Figure  2.1.3 

We  leave  it  to  you  to  define  the  statement  "limx^_oo  f(x)  =  L"  (Exercise  2.1.14)  and 
to  show  that  Theorems  2.1.3  and  2.1.4  remain  valid  if  xq  is  replaced  throughout  by  oo  or 
— oo  (Exercise  2.1.16). 


Example  2.1.11  Let 


Then 


/(*)  =  1  - 


2\x\ 


1  +  x 
lim  f{x)  =  1, 


,     and     h(x)  =  sin*. 


since 


and 


|/(jc)-l[  =  -L<e    if  x>-^-, 

X 


lim  g(x)  =  2, 


\g(x)-2\  = 


2x 


1  +  x 


2  2  2 
  <  —  <  c    if    x  >  -. 

1  +  X        X  € 


However,  lirn^oo  h{x)  does  not  exist,  since  h  assumes  all  values  between  —1  and  1  in  any 
semi-infinite  interval  (t,  oo). 

We  leave  it  to  you  to  show  that  limx^_oo  f(x)  =  1,  limx^_oo  g(x)  =  —2,  and 
liirijc^-oo  h(x)  does  not  exist  (Exercise  2.1.17).  ■ 
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We  will  sometimes  denote  limx^oo  / (x)  and  limx^_oo  / (x)  by  / (oo)  and  / (—00), 
respectively. 


Infinite  Limits 

The  functions 


/(*)  =  -,     g(x)  =  \,     P(x)  =  sin-1 


-T 


and 

#(*)  =  sin- 

do  not  have  limits,  or  even  one-sided  limits,  at  xq  =  0.  They  fail  to  have  limits  in  different 

ways: 

•  / (x)  increases  beyond  bound  as  x  approaches  0  from  the  right  and  decreases  beyond 
bound  as  x  approaches  0  from  the  left; 

•  g(x)  increases  beyond  bound  as  x  approaches  zero; 

•  p(x)  oscillates  with  ever-increasing  frequency  as  x  approaches  zero; 

•  q(x)  oscillates  with  ever-increasing  amplitude  and  frequency  as  x  approaches  0. 

The  kind  of  behavior  exhibited  by  /  and  g  near  xq  =  0  is  sufficiently  common  and 
simple  to  lead  us  to  define  infinite  limits. 

Definition  2.1.8  We  say  that  f(x)  approaches  00  as  x  approaches  xq  from  the  left, 
and  write 

lim   f(x)  =  00    or     f(xo— )  =  00, 

X— >XQ  — 

if  /  is  defined  on  an  interval  (a,  Xq)  and,  for  each  real  number  M ,  there  is  a  S  >  0  such 
that 

f(x)>M    if    xo  —  S  <  x  <  xo- 

Example  2.1.12  We  leave  it  to  you  to  define  the  other  kinds  of  infinite  limits  (Exer- 
cises 2.1.19  and  2.1.21)  and  show  that 

1  1 
lim  —  =  —00,      lim  —  =  00; 

x-H)-  x  x->0+  x 

1  1  1 

lim  —  =  lim  —  =  lim  —  =  00; 

x^O-  X2       x-»0+  X2       x^O  X2 

lim  x2  =    lim  x2  =  00; 


and 


lim  x3  =  00.       lim  x3  =  —00. 

x—>oo  x^—00 
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Throughout  this  book,  "lirric-^o  / (x)  exists"  will  mean  that 

lim  f(x)  =  L,    where  L  is  finite. 

x—t-xo 

To  leave  open  the  possibility  that  L  =  ±oo,  we  will  say  that 

lim  / (x)    exists  in  the  extended  reals. 

X— >XQ 

This  convention  also  applies  to  one-sided  limits  and  limits  as  x  approaches  ±oo. 

We  mentioned  earlier  that  Theorems  2.1.3  and  2.1.4  remain  valid  if  "limx^xo"  is  re- 
placed by  "limx^xo_"  or  "limx^xo  +  ."  They  are  also  valid  with  xq  replaced  by  ±oo. 
Moreover,  the  counterparts  of  (2.1.10),  (2.1.11),  and  (2.1.12)  in  all  these  versions  of  The- 
orem 2.1.4  remain  valid  if  either  or  both  of  L\  and  are  infinite,  provided  that  their 
right  sides  are  not  indeterminate  (Exercises  2.1.28  and  2.1.29).  Equation  (2.1.14)  and  its 
counterparts  remain  valid  if  L\/ L2  is  not  indeterminate  and  L2  7^  0  (Exercise  2.1.30). 

Example  2.1.13  Results  like  Theorem  2.1.4  yield 

ex  —  e~x      1  /  _  \ 

lim  sinhx  =  lim    =  -  (  lim  ex  —  lim  e~x  I 

x— >oo  x— >oo         2  2  \x— >oo  x— >oo  / 

=  -(00  —  0)  =  00, 


ex  —  e  x      1  /  _  \ 

lim  sinhx  =    lim    =  -  I   lim  ex  —   lim  e~x  I 

X— >  —  00  x— >  —  00  2  2  \x— >~  00  X— >  —  00  / 

-(0  —  00)  =  —00, 
2 


and 


e~x      x^ooe  X  0 

lim   =    =  _  =  0. 

x^oo   x  lim  x  00 

x— >oo 

Example  2.1.14  If 

f{x)  =  e2x-ex, 
we  cannot  obtain  limx^oo  / (x)  by  writing 

lim  f(x)  =  lim  e2x  —  lim  ex, 

X^rOO  X->00  X— >0O 

because  this  produces  the  indeterminate  form  00  —  00.  However,  by  writing 

f(x)  =  e2x(l-e-x), 

we  find  that 

lim  f(x)  =  (  lim  e2x)  (lim  1  -  lim  e~x)  =  oo(l  -  0)  =  00. 

x—>oo  \x—>oo         /  \x— >OQ  X^-OO  / 
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Example  2.1.15  Let 


2x2  -  x  +  1 


3x2  +  2x  -  1 ' 

Trying  to  find  lirn  ^oo  g(x)  by  applying  a  version  of  Theorem  2.1.4  to  this  fraction  as  it  is 
written  leads  to  an  indeterminate  form  (try  it!).  However,  by  rewriting  it  as 

2-\/x+\/x2 
S(x)  =  — —  — -,     x  ^  0, 


we  find  that 


lim  g(x) 


3  +  2/x-l/x2' 
lim  2—  lim  l/x+  lim  l/x2      o  _  n -u  0  9 

x^>-oo  x— >oo  X  — >oo    U  -(-  U    Z 

lim  3+  lim  2/x-  lim  l/x2  ~  3  +  0-0  ~  3' 

x— >oo  x— »-oo  x— >oo 


Monotonic  Function 

A  function  /  is  nondecreasing  on  an  interval  /  if 

/ (xi)  —  f  (xi)  whenever  x\  and  x-i  are  in  /  and  x\  <  x%, 
or  nonincreasing  on  /  if 

/ (xi)  —  f  (xi)    whenever  x\  and  Xi  are  in  /  and  x\  <  x%- 


(2.1.19) 


(2.1.20) 


In  either  case,  /  is  on  /.  If  <  can  be  replaced  by  <  in  (2.1.19),  /  is  increasing  on  I .  If  > 
can  be  replaced  by  >  in  (2.1.20),  /  is  decreasing  on  I .  In  either  of  these  two  cases,  /  is 
strictly  monotonic  on  / . 


Example  2.1.16  The  function 


/(*)  = 


[x,  0  <  x  <  1, 
h,  Kx<2, 


is  nondecreasing  on  /  =  [0,  2]  (Figure  2.1.4),  and  — /  is  nonincreasing  on  /  =  [0,  2]. 

y 


2 


x 
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Figure  2.1.6 
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In  the  proof  of  the  following  theorem,  we  assume  that  you  have  formulated  the  definitions 
called  for  in  Exercise  2.1.19. 

Theorem  2.1.9  Suppose  that  f  is  monotonic  on  (a,b)  and  define 
a  =    inf  f(x)    and    f)  =    sup  f(x). 

a<x<b  a<x<b 

(a)  If  f  is  nondecreasing,  then  f(a+)  =  a  and  f  (b—)  =  ft. 

(b)  Iff  is  nonincreasing,  then  f(a+)  =  ji  and  fib— )  =  a. 
(Here  a+  =  — oo  if  a  =  — oo  and  b—  =  oo  ifb  =  oo.) 

(c)  If  a  <  Xq  <  b,  then  f  (xo+)  and  f  (xo— )  exist  and  are  finite  ;  moreover, 

/Oo-)  <  f(x0)  <  f(x0  +  ) 

iff  is  nondecreasing,  and 

f(xo-)  >  f(x0)  >  f(x0+) 

iff  is  nonincreasing. 

Proof    (a)  We  first  show  that  f(a+)  =  a.  If 

M  >  a,  there  is  an  xq  in  (a,b)  such  that  f(xo)  <  M.  Since  /  is  nondecreasing, 
fix)  <  M  if  a  <  x  <  xq.  Therefore,  if  a  =  —  oo,  then  f(a+)  =  — oo.  If  a  >  — oo,  let 
M  =  a  +  e,  where  e  >  0.  Then  a  <  f{x)  <  a  +  e,  so 

\f(x)—a\<e    if    a  <  x  <  x0.  (2.1.21) 

If  a  =  — oo,  this  implies  that  /(— oo)  =  a.  If  a  >  — oo,  let  S  =  xq  —  a.  Then  (2.1.21)  is 
equivalent  to 

|  / (x)  —  a  |  <  e    if    a  <  x  <  a  +  S, 

which  implies  that  f(a+)  =  a. 

We  now  show  that  / (b—)  =  /3.  If  M  <  f>,  there  is  an  xq  in  (a,  b)  such  that  / (xo)  >  M. 
Since  /  is  nondecreasing,  f(x)  >  M  if  xq  <  x  <  b.  Therefore,  if  fi  =  oo,  then 
f(b-)  =  oo.  If  P  <  oo,  let  M  =  P  -  e,  where  e  >  0.  Then  f>  -  e  <  f(x)  <  fi,  so 

\f(x)-fi\<€    if    x0<x<b.  (2.1.22) 

If  b  =  oo,  this  implies  that  /(oo)  =  /},  If  b  <  oo,  let  S  =  b  —  xq.  Then  (2.1.22)  is 
equivalent  to 

\f(x)-P\<e    if  b-8<x<b, 
which  implies  that  f(b-)  =  ft. 

(b)  The  proof  is  similar  to  the  proof  of  (a)  (Exercise  2.1.34). 

(c)  Suppose  that  /  is  nondecreasing.  Applying  (a)  to  /  on  (a,xo)  and  (xo,b)  sepa- 
rately shows  that 

/(x0-)=    sup    f{x)    and     f(x0+)  =    inf  f(x). 

a<x<xo  xq<x<d 
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However,  if  x\  <  xq  <  X2,  then 

<  /(*o)  <  /(jc2); 

hence, 

f(xo~)  <  /(*o)  <  /(*o+). 
We  leave  the  case  where  /  is  nonincreasing  to  you  (Exercise  2.1.34).  H 

Limits  Inferior  and  Superior 

We  now  introduce  some  concepts  related  to  limits.  We  leave  the  study  of  these  concepts 
mainly  to  the  exercises. 

We  say  that  /  is  bounded  on  a  set  S  if  there  is  a  constant  M  <  oo  such  that  |  / (x)|  <  M 
for  all  x  in  S . 

Definition  2.1.10  Suppose  that  /  is  bounded  on  [a,  xo),  where  xq  may  be  finite  or  oo. 
For  a  <  x  <  xo,  define 

Sf(x;x0)=    sup  fit) 

X<t<XQ 

and 

//(x;x0)  =    inf  /(*). 

x<t<xo 

Then  the  left  limit  superior  of  f  at  xq  is  defined  to  be 

lim  f{x)  =    lim  S/(x;xo), 

X— >XQ—  X^XQ  — 

and  the  left  limit  inferior  of  f  at  xq  is  defined  to  be 

lim  fix)  =    lim  Ifix;xa). 

x^xq-  X-KKO- 

(If  xq  =  oo,  we  define  xq—  =  oo.) 

Theorem  2.1.11  If  f  is  bounded  on  [a,xo),  then  /3  =  limx^xo_/(x)  exists  and  is 
the  unique  real  number  with  the  following  properties  : 

(a)  If  e  >  0,  there  is  an  a\  in  [a,  xq)  such  that 

fix)<P  +  e    if    ai<x<x0.  (2.1.23) 

(b)  If  e  >  0  and  ai  is  in  [a,  Xq),  then 

f  (x)  >  f3  —  e   for  some  x  e  [a\,  xq). 

Proof  Since  /  is  bounded  on  [a,Xo),  S/ix;xo)  is  nonincreasing  and  bounded  on 
[a,  Xq).  By  applying  Theorem  2.1.9(b)  to  S fix;  xq),  we  conclude  that  f)  exists  (finite). 
Therefore,  if  e  >  0,  there  is  an  a  in  [a,  xo)  such  that 

P  -c/2  <  S/(x;x0)  <  P  +  e/2    if    a  <  x  <  x0.  (2.1.24) 
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Since  S/(x;  xo)  is  an  upper  bound  of  {/(f)  |  x  <  t  <  xq),  f(x)  <  S  /(x;  Xq).  Therefore, 
the  second  inequality  in  (2.1.24)  implies  (2.1.23)  with  ci\  =  a.  This  proves  (a).  To  prove 
(b),  let  ct\  be  given  and  define  x\  =  max(ai,a).  Then  the  first  inequality  in  (2.1.24) 
implies  that 

Sf(xi;x0)>  P-e/2.  (2.1.25) 

Since  S f{x\\  xq)  is  the  supremum  of  {/(f)  I  Xi  <  t  <  Xo},  there  is  an  x  in  [x\,  xq)  such 
that 

f{x)  >  Sf(xi;x0)  -  e/2. 
This  and  (2.1.25)  imply  that /(X)  >  fi  —  e.  SinceXis  in  [a\,xo),  this  proves  (b). 

Now  we  show  that  there  cannot  be  more  than  one  real  number  with  properties  (a)  and 
(b).  Suppose  that  Pi  <  P2  and  P2  has  property  (b);  thus,  if  e  >  0  and  a\  is  in  [a,  Xo), 
there  is  an  X  in  [a\ ,  xq)  such  that  / (x)  >  P2  —  e.  Letting  e  =  P2  —  Pi ,  we  see  that  there 
is  an  x  in  [a \ ,  b)  such  that 

f(x)>  P2-(P2-Pl)=  Pu 

so  Pi  cannot  have  property  (a).  Therefore,  there  cannot  be  more  than  one  real  number 
that  satisfies  both  (a)  and  (b).  TJ 

The  proof  of  the  following  theorem  is  similar  to  this  (Exercise  2.1.35). 

Theorem  2.1.12  If  f  is  bounded  on  [a,  xq),  then  a  =  limT_>T0_  f{x)  exists  and  is 
the  unique  real  number  with  the  following  properties: 

(a)  Ife  >  0,  there  is  an  a\  in  [a,  xq)  such  that 

f(x)>a  —  e    if    a\  <  x  <  xo- 

(b)  Ife  >  0  and  ai  is  in  [a,  xq),  then 

f(x)  <  a  +  e   for  some  x  e  [ai,Xo). 

2.1  Exercises 


1.  Each  of  the  following  conditions  fails  to  define  a  function  on  any  domain.  State 
why. 

(a)  sin/(x)  =  x  (b)  e/w  =  -|x| 

(c)  1  +  x2  +  [f(x)]2  =  0  (d)  f(x)[f(x)  -l]  =  x2 

2.  If   

(x  -  3)(x  +  2)  x2-\6  nr— 

  and    g(x)  =   Vr-  9, 

x  —  1  x  —  7 

find  Df,  Df±g,  Dfg,  and  Df/g. 
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3.    Find  Df. 

(a)/(x)  =  tanx  (b)/(x) 


y/ 1  —  |  sin  x  | 


(e)  e[/Wl2  =  x,    f{x)  >  0 

4.  Find  limx^xo  / (x),  and  justify  your  answers  with  an  e-S  proof. 

^3    g 

(a)  x2  +  2x  +  1,    x0  =  1  (b)   — ,    x0  =  2 

(c)  ,    *0  =  0  (d)  v^,    *o  =  4 

x  —  l 

(6)  (,-1)^-2)  +"'         =  1 

5.  Prove  that  Definition  2.1.2  is  unchanged  if  Eqn.  (2.1.4)  is  replaced  by 

|/(*)-L|  <  tfe, 

where  AT  is  any  positive  constant.  (That  is,  limx^xo  / (x)  =  L  according  to  Defini- 
tion 2. 1 .2  if  and  only  if  limx^xo  fix)  =  L  according  to  the  modified  definition.) 

6.  Use  Theorem  2.1.4  and  the  known  limits  lim*-^,,  x  =  xq,  \imx^.xo  c  =  c  to  find 
the  indicated  limits. 

,  s        x2  +  2x  +  3                          „  s        (    \           1  \ 
(a)  lim  — — —  (b)  lim  

^    '^2      2l3+l  V    '  x^2\x  +  1  X-lJ 

(\                                 1                                                   /  ■■  \  1 
c)  lim   (d)  lim  —  

'  x^l  x3  +  x2-2x  v  x^lx4-l 

7.  Find  limx_>xo_  / (x)  and  limx^xo+  / (x),  if  they  exist.  Use  e-S  proofs,  where  ap- 
plicable, to  justify  your  answers. 

,  x  x  +  |x|  s         1  1  1 

(a)  ,    xq  =  0         lb)*  cos — h  sin — I- sin- — -,    xo  =  0 

X  x  x  \x\ 

.  .      \x  —  II  ,  ,s  x2  +  x  —  2 

8.  Prove:  If  h(x)  >  0  for  a  <  x  <  Xo  and  limx^xo_  /z(x)  exists,  then  limx^xo_  /z(x) 
>  0.  Conclude  from  this  that  if  fzix)  >  for  a  <  jc  <  xo,  then 

lim   fzix)  >    lim  /i(x) 

X— >XQ—  X— >XQ  — 


if  both  limits  exist. 
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9.  (a)  Prove:  If  limx^xo  f[x)  exists,  there  is  a  constant  M  and  a  p  >  0  such  that 
I /Ml  <  M  if  0  <  |x  —  xo\  <  P-  (We  say  then  that  /  is  bounded  on 
{x  |  0  <  \x  -  x0|  <  p).) 

(b)  State  similar  results  with  "limx^xo"  replaced  by  "limx^xo_." 

(c)  State  similar  results  with  "limx^X()"  replaced  by  "limx^xo+." 

10.  Suppose  that  limx^xo  / (x)  =  L  and  n  is  a  positive  integer.  Prove  that  limx^xo  [/ (x)]n 
Ln  (a)  by  using  Theorem  2.1.4  and  induction;  (b)  directly  from  Definition  2.1.2. 
Hint:  You  will  find  Exercise  2.1.9  useful  for  (b). 

11.  Prove:  If  limx^xo  f(x)  =  L  >  0,  then  limx^xo  yj  f{x)  =  -J~L. 

1 2 .  Prove  Theorem  2 . 1 . 6 . 

13.  (a)   Using  the  hint  stated  after  Theorem  2.1.6,  prove  that  Theorem  2.1.3  remains 

valid  with  "limx^xo"  replaced  by  "limx^xo_." 
(b)   Repeat  (a)  for  Theorem  2.1.4. 

14.  Define  the  statement  "limx^_oo  f(x)  =  L." 

15.  Find  linix-j.oo  f(x)  if  it  exists,  and  justify  your  answer  directly  from  Definition  2. 1.7. 

,  .      1  .  sinx  ,  x  sinx 

(a)  —   (b)  — —    (a  >  0)  (c)  — —    (a  <  0) 

W  x2  +  1  V  '  \x\a     V        '  V  '  \x\a     K    -  ' 

(d)  e_Ji:sinx  (e)  tanx  (f)e~x2e2x 

16.  Theorems  2.1.3  and  2.1.4  remain  valid  with  "limx^xo"  replaced  throughout  by 
"limx^oo"  ("limx^_oo").  How  would  their  proofs  have  to  be  changed? 

17.  Using  the  definition  you  gave  in  Exercise  2.1.14,  show  that 

2|x| 


(a)    lim   ( 1  -  —  )  =  1  (b)  lim 

x^-oo  \         X    I  x^—c 


x^-ca  1  +  x 


(c)    lim  sin  x  does  not  exist 


18.  Find  limx^_oo  / (x),  if  it  exists,  for  each  function  in  Exercise  2.1.15.  Justify  your 
answers  directly  from  the  definition  you  gave  in  Exercise  2.1.14. 

19.  Define 

(a)    lim  f(x)  =  — oo     (b)    lim   f(x)  =  oo       (c)    lim   fix)  =  — oo 

X— >XQ—  X— >XQ+  X— >XQ  + 


20.  Find 

x->0+  X3  X-*0—  X 


(a)  lim  (b)  lim  — 


(c)   lim  —  (d)   lim  — 

x^0+  x6  x~*Q-  x6 

(e)    lim  ,  7  ! — \7k  (f )  J™  ' 


x^x0+  (x  —  X0)2k  x->-x0-  (x  —  Xo)2k+1 

(k  =  positive  integer) 
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21.  Define 


(a)  lim  f(x)  =  oo  (b)  lim  f(x)  =  —oo 

x— yxQ  x—>xq 


22.  Find 


(a)  lim  — r-  (b)  lim 

x^-0  x3  x^O  x° 

(c)  lim  7  "  TtT  (d)  lim  ' 


x^x0  (x  —  Xo)2k  x^-x0  (x  —  Xo)2k+1 

(k  =  positive  integer) 

23.  Define 

(a)  lim  f(x)  =  oo  (b)    lim  /(x)  =  —  oo 

x— *oo  x— y—  oo 

24.  Find 

(a)  lim  x2fc  (b)    lim  x2k 


(c)  lim  x2fc+1  (d)    lim  x2k+1 

x— >oo  x— oo 

(A;=positive  integer) 

(e)  lim  ~/x  sin*  (f )  lim  ex 

x— >oo  x— »oo 

25.  Suppose  that  /  and  g  are  defined  on  (a,oo)  and  (c,oo)  respectively,  and  that 
g{x)  >  a  if  x  >  c.  Suppose  also  that  lim^-xx,  f(x)  =  L,  where  — oo  <  L  <  oo, 
and  limx^oo  g(x)  =  oo.  Show  that  Hindoo  f(g(x))  =  L. 

26.  (a)   Prove:  limx^xo  / (x)  does  not  exist  (finite)  if  for  some  eo  >  0,  every  deleted 

neighborhood  of  xq  contains  points  x\  and  x2  such  that 

|/0ci)-/(*2)|  >eo- 
(b)    Give  analogous  conditions  for  the  nonexistence  of 

lim   f(x),       lim   f(x),      lim  f(x),    and       lim  f(x). 

x— *xo+  X— >XQ—  x— *oo  X— >— oo 

27.  Prove:  If  — oo  <  xq  <  oo,  then  limx^xo  f(x)  exists  in  the  extended  reals  if  and 
only  if  limx^xo_  f(x)  and  limx^.xo+  / (x)  both  exist  in  the  extended  reals  and  are 
equal,  in  which  case  all  three  are  equal. 

In  Exercises  2.1.28-2.1.30  consider  only  the  case  where  at  least  one  of  L\  and  Li  is  ±oo. 

28.  Prove:  If  limx^xo  / (x)  =  L\,  limx^xo  g{x)  =  L2,  and  L\  +  L2  is  not  indetermi- 
nate, then 

lim  (/  +  g)(x)  =  Ll  +  L2. 
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29.  Prove:  If  lirrix-j.oo  fix)  =  L\,  lirn^oo  g(x)  =  L2,  and  L\L2  is  not  indeterminate, 
then 

lim  (fg)(x)  =  LYL2. 

x— >oo 

30.  (a)   Prove:  If  limx^xo  f(x)  =  Lu  limx^XQ  g(x)  =  L2  ^  0,  and  Li/L2  is  not 

indeterminate,  then 


lim 

X  — >XQ 


(b)    Show  that  it  is  necessary  to  assume  that  L2  ^  0  in(a)  by  considering  / (x)  = 
sinx,  g(x)  =  cos x,  and  xq  =  it/ 2. 


31.  Find 


,  .  x3  +  2x  +  3  /,  \  ,.       x3  +  2.x  +  3 

(a)     llm,    o„4    I  ■    o  (b)  hl? 


*^-0+  2jc4  +  3jc2  +  2  v  '  x^o-  2x4  +  3x2  +  2 

,  s  2jt4  +  3;t2  +  2  ,  2.x4  +  3x2  +  2 
(c)  lim  — =   (d)    lim  — =  

V   '  x^oo    x3  +  2x  +  3  V    '  x^-oo    x3  +  2x  +  3 

(e)  linWooCe*2  -  e*)  (f )  lim  *  +  ^sin* 

x^oo    2x  +  e~x 

32.  Find  limx^oo  r(x)  and  limx^_oo  r(x)  for  the  rational  function 

.  .      a0  +  a\x  H  V  anx" 

r(x)  — —   

bo  +  b\x  H  V  bmxm' 

where  a„  ^  0  and  &m  7^  0. 

33.  Suppose  that  limx^xo  f{x)  exists  for  every  xq  in  {a,  b)  and  g^x)  =  f(x)  except 
on  a  set  S  with  no  limit  points  in  (a,  b).  What  can  be  said  about  \m\x->XQ  g{x)  for 
xq  in  (a,  b)l  Justify  your  answer. 

34.  Prove  Theorem  2.1.9(b),  and  complete  the  proof  of  Theorem  2.1.9(b)  in  the  case 
where  /  is  nonincreasing. 

35.  Prove  Theorem  2.1.12. 

36.  Show  that  if  /  is  bounded  on  [a,  Xo),  then 

(a)  lim  /(*)<   Em  /(*). 

x— >xo 

(b)  lhn  (-/)(x)  =  -  lim  f(x)  and   Urn  (-/)(x)  =  -  lim  fix). 

x—>xo—  x—*-xq—  x~*xo—  x— >xo_ 

(c)  lim   /(x)  =    lim   f{x)  if  and  only  if  \ivax^.xo-  fix)  exists,  in  which 

x^x0-  x^x0- 

case 

lim   f{x)  =    lim   fix)  =    lim  fix). 

x^x0-  x^x0-  x^x0- 

37.  Suppose  that  /  and  g  are  bounded  on  [a,  xq). 


Section  2.2  Continuity  53 


(a)   Show  that 


lim  (/  +  g)(x)  <    lim   f(x)  +    lim  g(x). 

X  — >XQ  —  X— >xq—  x~»xo  — 


(b)    Show  that 


lim  (/  +  g)(x)  >    lim   fix)  +    lim  g(x). 

X— +XQ—  X— >XQ—  x—>xo  — 


(c)    State  inequalities  analogous  to  those  in  (a)  and  (b)  for 


lim  (/  -  g)(x)    and       lim_(/  -  g)(x). 

X—*X(j—  X~*XQ 


38.  Prove:  limjt;^xo_  fix)  exists  (finite)  if  and  only  if  for  each  e  >  0  there  is  a  S  >  0 
such  that  \f(xi)  —  f(x2)\  <  €  if  xq  —  S  <  x\,  x-i  <  xq.  Hint:  For  sufficiency, 
show  that  f  is  bounded  on  some  interval  (a,  Xq)  and 


lim  f(x)  =    lim  f(x). 

Then  use  Exercise  2.1.36(c). 

39.  Suppose  that  /  is  bounded  on  an  interval  (xq,  b].  Using  Definition  2. 1.10  as  a  guide, 
define  limx^xo+  fix)  (the  right  limit  superior  of  f  at  xq)  and  limT^,T0+  f{x)  (the 
right  limit  inferior  of  f  atxo).  Then  prove  that  they  exist.  Hint:  Use  Theorem  2.1.9. 

40.  Suppose  that  /  is  bounded  on  an  interval  (xo,b].  Show  that  limT^T0+  fix)  = 
limx^xo+  / (x)  if  and  only  if  \imx^XQ  +  / (x)  exists,  in  which  case 

lim   fix)  =    lim   fix)  =    lim  fix). 

x-*-xo+  X-TXQ+  x^x0+ 


41.     Suppose  that  /  is  bounded  on  an  open  interval  containing  xq.  Show  that  limx^xo  / (x) 
exists  if  and  only  if 


lim   fix)  =    lim   fix)  =    lim   fix)  =    lim  fix), 

X^XQ-  X^-XQ  +  X^XQ-  X^XQ  + 


in  which  case  limx^xo  fix)  is  the  common  value  of  these  four  expressions. 


2.2  CONTINUITY 

In  this  section  we  study  continuous  functions  of  a  real  variable.  We  will  prove  some  impor- 
tant theorems  about  continuous  functions  that,  although  intuitively  plausible,  are  beyond 
the  scope  of  the  elementary  calculus  course.  They  are  accessible  now  because  of  our  better 
understanding  of  the  real  number  system,  especially  of  those  properties  that  stem  from  the 
completeness  axiom. 
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The  definitions  of 

f(xo-)  =    lim   f(x),     f(xo+)  =    lim   f{x),     and      lim  f(x) 

X^XQ-  X-/-XQ  +  x->xo 

do  not  involve  / (xq)  or  even  require  that  it  be  defined.  However,  the  case  where  / (xo)  is 
defined  and  equal  to  one  or  more  of  these  quantities  is  important. 

Definition  2.2.1 

(a)  We  say  that  /  is  continuous  at  xo  if  /  is  defined  on  an  open  interval  (a,  b)  containing 
xQ  and  lim*-^  f(x)  =  f(x0). 

(b)  We  say  that  /  is  continuous  from  the  left  at  xq  if  /  is  defined  on  an  open  interval 
(a,  x0)  and  f(x0-)  =  f(x0). 

(c)  We  say  that  /  is  continuous  from  the  right  at  xq  if  /  is  defined  on  an  open  interval 
(x0,  b)  and  f(x0+)  =  f(x0).  ■ 

The  following  theorem  provides  a  method  for  determining  whether  these  definitions  are 
satisfied.  The  proof,  which  we  leave  to  you  (Exercise  2.2.1),  rests  on  Definitions  2.1.2, 
2.1.5,  and  2.2.1. 

Theorem  2.2.2 

(a)   A  function  f  is  continuous  at  Xq  if  and  only  if  f  is  defined  on  an  open  interval  (a,  b) 
containing  Xq  and  for  each  e  >  0  there  is  a  8  >  0  such  that 


whenever  \x  —  Xq\  <  S. 

(b)  A  function  f  is  continuous  from  the  right  at  Xq  if  and  only  if  f  is  defined  on  an 
interval  [xo,  b)  and  for  each  €  >  0  there  is  a  8  >  0  such  that  (2.2.1)  holds  whenever 

Xq  <  X  <  Xq  +  S, 

(c)  A  function  f  is  continuous  from  the  left  at  Xq  if and  only  if  f  is  defined  on  an  interval 
(a,  Xq]  and  for  each  e  >  0 

there  is  a  8  >  0  such  that  (2.2.1)  holds  whenever  Xq  —  8  <  x  <  Xq. 

From  Definition  2.2.1  and  Theorem  2.2.2,  /  is 
continuous  at  xq  if  and  only  if 


or,  equivalently,  if  and  only  if  it  is  continuous  from  the  right  and  left  at  xo  (Exercise  2.2.2). 
Example  2.2.1  Let  /  be  defined  on  [0, 2]  by 


|/(*)-/(*o)|  <e 


(2.2.1) 


f(xo-)  =  f(x0+)  =  f(x0) 


/(*)  = 


x2,  0  <  x  <  1, 
x  +  1,    1  <  x  <  2 
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(Figure  2.2.1);  then 

/(o+)  =  o  =  /(0), 

/(1-)  =  1  ?  /(I)  =  2, 
/(l+)  =  2  =  /(l), 
/(2-)  =  3  =  f(2). 

Therefore,  /  is  continuous  from  the  right  at  0  and  1  and  continuous  from  the  left  at  2,  but 
not  at  1.  If  0  <  x,  xq  <  1,  then 

\f(x)-  f(x0)\  =  \x2-x%\  =  \x-x0\  \x+x0\ 

<  2\x  —  Xo\  <  €    if     \x  —  xo\<e/2. 

Hence,  /  is  continuous  at  each  xq  in  (0,  1).  If  1  <  x,  xq  <  2,  then 

\f(x)  -  f(x0)\  =  \(x  +  1)  -  (x0  +  1)  =  \x  -  x0\ 
<  e    if     \x  —  xq\  <  e. 

Hence,  /  is  continous  at  each  xq  in  (1,2). 


'y  =  x+l,  l<x<2 


y  =  x2,  0<x<\ 


Figure  2.2.1 

Definition  2.2.3  A  function  /  is  continuous  on  an  open  interval  {a,  b)  if  it  is  continu- 
ous at  every  point  in  (a,b).  If,  in  addition, 


or 


/(*-)  =  f(b) 
f(a+)  =  f(a) 


(2.2.2) 
(2.2.3) 
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then  /  is  continuous  on  (a,b]  or  [a,b),  respectively.  If  /  is  continuous  on  (a,  b)  and 
(2.2.2)  and  (2.2.3)  both  hold,  then  /  is  continuous  on  [a,b].  More  generally,  if  S  is  a  subset 
of  D  f  consisting  of  finitely  or  infinitely  many  disjoint  intervals,  then  /  is  continuous  on  S 
if  /  is  continuous  on  every  interval  in  S.  (Henceforth,  in  connection  with  functions  of  one 
variable,  whenever  we  say  "/  is  continuous  on  5"'  we  mean  that  S  is  a  set  of  this  kind.) 

Example  2.2.2  Let  f(x)  =  -Jx,  0  <  x  <  oo.  Then 

\f(X)-  f(0)\  =  ^  <€     if  0<X<62, 

so  /(0+)  =  /(0).  If  x0  >  0  and  x  >  0,  then 

i/o)-  /(x0)i  =  ivi-  v^i  =  l*~XoL_ 

-Jx  +  V^o 

\x  -x0\ 


<e    if     \x  —  xq\  <  e^/xo, 


/x0 

so  limx^.X(J  f  (x)  =  f  (xo).  Hence,  /  is  continuous  on  [0,  oo). 
Example  2.2.3  The  function 

1 


*(*)  = 


sin  Tix 


is  continuous  on  5  =  U«t=-oo(w'n  +  However,  g  is  not  continuous  at  any  xo  =  n 
(integer),  since  it  is  not  defined  at  such  points.  ■ 

The  function  /  defined  in  Example  2.2.1  (see  also  Figure  2.2.1)  is  continuous  on  [0.  1) 
and  [1,2],  but  not  on  any  open  interval  containing  1.  The  discontinuity  of  /  there  is  of  the 
simplest  kind,  described  in  the  following  definition. 

Definition  2.2.4  A  function  /  is  piecewise  continuous  on  [a,  b]  if 

(a)  / (xo+)  exists  for  all  xq  in  [a,  b); 

(b)  / {xq— )  exists  for  all  xq  in  (a,  b]; 

(c)  / (xo+)  =  f  (xq—  )  =  / (xq)  for  all  but  finitely  many  points  xq  in  (a,  b). 

If  (c)  fails  to  hold  at  some  xq  in  (a,  b),  f  has  a  jump  discontinuity  at  xq.  Also,  /  has  a 
jump  discontinuity  at  a  if  f(a+)^  f(a)  or  at  b  if  f(b—)y^  f(b). 


Example  2.2.4  The  function 


fix)  =  ■ 


1,  x  =  0, 

x,  0  <  x  <  1, 

2,  x  =  1, 

x,  1  <  x  <  2, 

-1,  2  <  x  <  3, 

0,  x  =  3, 


(Figure  2.2.2)  is  the  graph  of  a  piecewise  continuous  function  on  [0,  3],  with  jump  discon- 
tinuities at  xq  =  0,  1,  2,  and  3. 
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1  "- 


-1  - 


Figure  2.2.2 

The  reason  for  the  adjective  "jump"  can  be  seen  in  Figures  2.2.1  and  2.2.2,  where  the 
graphs  exhibit  a  definite  jump  at  each  point  of  discontinuity.  The  next  example  shows  that 
not  all  discontinuities  are  of  this  kind. 


Example  2.2.5  The  function 


sin  — ,  i 

x 


0. 


x  =  0, 


is  continuous  at  all  xq  except  xq  =  0.  As  x  approaches  0  from  either  side,  / (x)  oscillates 
between  —1  and  1  with  ever-increasing  frequency,  so  neither  /(0+)  nor  /(0— )  exists. 
Therefore,  the  discontinuity  of  /  at  0  is  not  a  jump  discontinuity,  and  if  p  >  0,  then  /  is 
not  piecewise  continuous  on  any  interval  of  the  form  [— p,  0],  [— p,  p],  or  [0,  p].  ■ 

Theorems  2.1.4  and  2.2.2  imply  the  next  theorem  (Exercise  2.2.18). 

Theorem  2.2.5  If  f  and  g  are  continuous  on  a  set  S,  then  so  are  f  +  g,  f  —  g,  and 
fg.  In  addition,  f/g  is  continuous  at  each  Xq  in  S  such  that  g(xo)  ^  0. 

Example  2.2.6  Since  the  constant  functions  and  the  function  f(x)  =  x  are  continu- 
ous for  all  x,  successive  applications  of  the  various  parts  of  Theorem  2.2.5  imply  that  the 
function 

9-x2 
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is  continuous  for  all  x  except  x  =  —  1  (see  Example  2.1.7).  More  generally,  by  starting 
from  Theorem  2.2.5  and  using 

induction,  it  can  be  shown  that  if  f\,  fi,  f„  are  continuous  on  a  set  S,  then  so  are 
fi  +  fz  +  1-  fn  and  f\ff"  fn-  Therefore,  any  rational  function 

a0  +  a\x  H  V  a„xn 

b0  +  b\x  H  1-  bmxm 

is  continuous  for  all  values  of  x  except  those  for  which  its  denominator  vanishes. 


Removable  Discontinuities 

Let  /  be  defined  on  a  deleted  neighborhood  of  xq  and  discontinuous  (perhaps  even  unde- 
fined) at  xq.  We  say  that  /  has  a  at  xq  if  limx^XQ  f  (x)  exists.  In  this  case,  the  function 


g(x)  = 


\f(x)  if  x  €  D  f  and  x  ^  xq  , 


lim  / (x)     if  x  =  xo. 


is  continuous  at  xq. 

Example  2.2.7  The  function 


/ (x)  =  x  sin  — 

x 


is  not  defined  at  xq  =  0,  and  therefore  certainly  not  continuous  there,  but  limx^o  f(x)  =  0 
(Example  2.1.6).  Therefore,  /  has  a  removable  discontinuity  at  0. 

The  function 

fi(x)  =  sin- 
x 

is  undefined  at  0  and  its  discontinuity  there  is  not  removable,  since  limx_>o  fi  (x)  does  not 
exist  (Example  2.2.5). 


Composite  Functions 

We  have  seen  that  the  investigation  of  limits  and  continuity  can  be  simplified  by  regarding  a 
given  function  as  the  result  of  addition,  subtraction,  multiplication,  and  division  of  simpler 
functions.  Another  operation  useful  in  this  connection  is  composition  of  functions;  that  is, 
substitution  of  one  function  into  another. 

Definition  2.2.6  Suppose  that  /  and  g  are  functions  with  domains  D f  and  Dg.  If 
Dg  has  a  nonempty  subset  T  such  that  g(x)  e  D  f  whenever  x  €  T,  then  the  composite 
function  f  o  g  is  defined  on  T  by 


(/  o  g)(x)  =  f(g(x)). 
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Example  2.2.8  If 

f(x)  =  log*    and     g(x)  = 


l-x2' 
then 

D /  =  (0,  oo)    and  =  {x  \  x  ^  ±1}  . 

Since  g(x)  >  0  if  x  e  7*  =  (—1,1),  the  composite  function  /  o  g  is  defined  on  (—1, 1)  by 

(/  0  g)(x)  =  log  j-j— J- 

We  leave  it  to  you  to  verify  that  g  o  f  is  defined  on  (0, 1/e)  U  (1/e,  e)  U  (e,  oo)  by 

1 


(g  o  /)(*)  = 


l-(logx)2 

The  next  theorem  says  that  the  composition  of  continuous  functions  is  continuous. 

Theorem  2.2.7  Suppose  that  g  is  continuous  at  x$,  g(xo)  is  an  interior  point  of D  y . 
and  f  is  continuous  at  g(xo).  Then  f  o  g  is  continuous  at  Xq. 

Proof  Suppose  that  e  >  0.  Since  g(xo  )  is  an  interior  point  of  D  f  and  /  is  continuous 
at  g(xo),  there  is  a  Si  >  0  such  that  / (/)  is  defined  and 

1/(0 -/(*(*>))[<  €    if     [r-*(JC0)|  <«i.  (2.2.4) 
Since  g  is  continuous  at  xq,  there  is  a  S  >  0  such  that  g(x)  is  defined  and 

\g(x)-g(x0)\  <Si    if    |*-jc0|<«.  (2.2.5) 
Now  (2.2.4)  and  (2.2.5)  imply  that 

\f(g(x))-f(g(x0))\<e    if  |jc-*0|<*. 
Therefore,  /  o  g  is  continuous  at  xq.  TJ 
See  Exercise  2.2.22  for  a  related  result  concerning  limits. 

Example  2.2.9  In  Examples  2.2.2  and  2.2.6  we  saw  that  the  function 

f(x)  =  Vx 
is  continuous  for  x  >  0,  and  the  function 

9-x2 


g(x)  = 


x  +  1 

is  continuous  for  x  ^  —1.  Since  g(x)  >  0  if  x  <  —  3  or  —  1  <  x  <  3,  Theorem  2.2.7 
implies  that  the  function 


(/  o  g)(x)  = 


'9-x 


2 


X  +  1 

is  continuous  on  (— oo,  —3)  U  (— 1,  3).  It  is  also  continuous  from  the  left  at  —3  and  3. 
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Bounded  Functions 

A  function  /  is  bounded  below  on  a  set  S  if  there  is  a  real  number  m  such  that 

/ (x)  >  m  for  all  x  €  S. 

In  this  case,  the  set 

7  =  {f{x)  \xzS} 

has  an  infimum  a,  and  we  write 

a  =  inf  fix). 
xeS 

If  there  is  a  point  *i  in  S  such  that  /(xi)  =  a,  we  say  that  a  is  the  minimum  of  f  on  S, 
and  write 

a  =  min  /  (x). 
xeS 

Similarly,  /  is  bounded  above  on  S  if  there  is  a  real  number  M  such  that  / (x)  <  M  for 
all  x  in  S.  In  this  case,  F  has  a  supremum  /3,  and  we  write 

£  =  sup  f{x). 
xeS 

If  there  is  a  point  x^  in  S  such  that  / (X2)  =  ft,  we  say  that  /3  is  the  maximum  of  f  on  S, 
and  write 

P  =  max  /  (x). 

If  /  is  bounded  above  and  below  on  a  set  S,  we  say  that  /  is  bounded  on  S. 

Figure  2.2.3  illustrates  the  geometric  meaning  of  these  definitions  for  a  function  / 
bounded  on  an  interval  5  =  [a,b].  The  graph  of  /  lies  in  the  strip  bounded  by  the 
lines  y  =  M  and  y  =  m,  where  M  is  any  upper  bound  and  m  is  any  lower  bound 
for  /  on  [a,b].  The  narrowest  strip  containing  the  graph  is  the  one  bounded  above  by 
y  =     =  supa<x<b  f(x)  and  below  by  y  =  a  =  infa<x<b  fix). 


y  =  m 
 *~  x 


Figure  2.2.3 
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Example  2.2.10  The  function 

(  i 

g(x)  = 


x  =  0    or    x  =  1, 


+ 


1  —  x,    0  <  x  <  1, 
(Figure  2.2.4(a))  is  bounded  on  [0,  1],  and 

sup  g(x)  =  1,       inf  gfx)  =  0. 

0<x<l  0<x<l 

Therefore,  g  has  no  maximum  or  minimum  on  [0,  1],  since  it  does  not  assume  either  of  the 
values  0  and  1 . 

The  function 

h(x)  =  1  -x,    0  <  x  <  1, 

which  differs  from  g  only  at  0  and  1  (Figure  2.2.4(b)),  has  the  same  supremum  and  infi- 
mum  as  g,  but  it  attains  these  values  at  x  =  0  and  x  =  1,  respectively;  therefore, 

max  h(x)  =  1    and      min  h(x)  =  0. 

0<x<l  0<x<l 


y 


y  =  g(x) 


(a) 


v 

M 


Figure  2.2.4 


Example  2.2.11  The  function 


fix)  =  e 


1 


y  =  1  —  x 


(b) 


x(x  -  1) 


,    0  <  x  <  1, 


oscillates  between  ±ex(-x  ^  infinitely  often  in  every  interval  of  the  form  (0,  p)  or  (1  — p,  1), 
where  0  <  p  <  1 ,  and 

sup  f(x)  =  l,      inf  /(x)  =  -l. 

0<x<l  0<x<l 


However,  /  does  not  assume  these  values,  so  /  has  no  maximum  or  minimum  on  (0,  1). 
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Theorem  2.2.8  Iff  is  continuous  on  a  finite  closed  interval  [a,  b],  then  f  is  bounded 
on  [a,b]. 

Proof  Suppose  that  t  e  [a,b].  Since  /  is  continuous  at  t,  there  is  an  open  interval  It 
containing  t  such  that 

\f{x)  -  f{t) |  <  1    if     xe/,n[fl,i].  (2.2.6) 

(To  see  this,  set  e  =  1  in  (2.2.1),  Theorem  2.2.2.)  The  collection  3f  =  {lt  |  a  <  t  <  b}  is 
an  open  covering  of  [a,b].  Since  [a,  b]  is  compact,  the  Heine-Borel  theorem  implies  that 
there  are  finitely  many  points  t\,  t2,  ■  ■  . ,  tn  such  that  the  intervals  Itx ,  It2,  . . . ,  Itn  cover 
[a,b].  According  to  (2.2.6)  with  t  =  t,, 

\f(x)-f(ti)\  <  1    if     xel,t  n[a,b]. 

Therefore, 

|/(jc)|  =  \{f{x)  -  f{u))  +  f{ti)\  <  \f{x)  -  f{u)\  +  | /ft) | 

(2.2.7) 

<l  +  |/ft)|    if  xeltin[a,b]. 

Let 

M  =  1  +  max  \  f(ti)\. 

\<i<n 

Since  [a,  ft]  C  |J"=i       n  (2-2-7)  implies  that  |/(x)|  <  M  if  x  e  [a,  b].  TJ 

This  proof  illustrates  the  utility  of  the  Heine-Borel  theorem,  which  allows  us  to  choose 
M  as  the  largest  of  a  finite  set  of  numbers. 

Theorem  2.2.8  and  the  completeness  of  the  reals  imply  that 

if  /  is  continuous  on  a  finite  closed  interval  [a,  b],  then  /  has  an  infimum  and  a  supre- 
mum  on  [a,b\.  The  next  theorem  shows  that  /  actually  assumes  these  values  at  some 
points  in  [a,b]. 

Theorem  2.2.9  Suppose  that  f  is  continuous  on  a  finite  closed  interval  [a,  b].  Let 
a  =    inf  fix)    and    f>  =    sup  fix). 

a<x<b  a<x<b 

Then  a  and  /3  are  respectively  the  minimum  and  maximum  of  f  on  [a,  b];  that  is,  there  are 
points  X\  and  x-i  in  [a,  b]  such  that 

fix\)  =  a    and     fix?)  =  f>. 

Proof   We  show  that  x\  exists  and  leave  it  to  you  to  show  that  x-i  exists  (Exercise  2.2.24). 

Suppose  that  there  is  no  x\  in  [a,b]  such  that  fix\)  =  a.  Then  fix)  >  a  for  all 
x  G  [a,b].  We  will  show  that  this  leads  to  a  contradiction. 

Suppose  that  t  e  [a,  b].  Then  fit)  >  a,  so 

fit)  >  ™   a- 
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Since  /  is  continuous  at  t,  there  is  an  open  interval  It  about  t  such  that 

f{x)  >        +  a    if    x  eltn[a,b]  (2.2.8) 

(Exercise  2.2.15).  The  collection  Jf  =  {lt  |  a  <  t  <  b}  is  an  open  covering  of  [a,  b].  Since 
[a,  b]  is  compact,  the  Heine-Borel  theorem  implies  that  there  are  finitely  many  points  t\, 
t2,  ■  ■  ■ ,  tn  such  that  the  intervals  Itx ,  It2, . . . ,  Itn  cover  [a,b].  Define 

.  f{U)+a 
ct\  =  min   . 

\<i<n  1 

Then,  since  [a,  b]  C  U"=i(7«;  n  Ia>       (2-2.8)  implies  that 

f(t)  >  ai,    a  <t  <  fc. 

But  ai  >  a,  so  this  contradicts  the  definition  of  a.  Therefore,  f(x\)  =  a  for  some  x\  in 
[a,  6].  H 

Example  2.2.12  We  used  the  compactness  of  [a,b]  in  the  proof  of  Theorem  2.2.9 
when  we  invoked  the  Heine-Borel  theorem.  To  see  that  compactness  is  essential  to  the 
proof,  consider  the  function 

g(x)  =  1  -  (1  -  x)sin-, 

x 

which  is  continuous  and  has  supremum  2  on  the  noncompact  interval  (0,  1],  but  does  not 
assume  its  supremum  on  (0,  1],  since 


1 

sin  — 

x 


g{x)<\  +  {\-x) 

<  1  +  (1  -x)  <  2    if    0  <  x  <  1. 
As  another  example,  consider  the  function 

f{x)  =  e~\ 

which  is  continuous  and  has  infimum  0,  which  it  does  not  attain,  on  the  noncompact  interval 
(0,  oo).  ■ 


The  next  theorem  shows  that  if  /  is  continuous  on  a  finite  closed  interval  [a.b],  then  / 
assumes  every  value  between  / (a)  and  / (b)  as  x  varies  from  a  to  b  (Figure  2.2.5,  page  64). 

Theorem  2.2.10  (Intermediate  Value  Theorem)  Suppose  that  f  is  con- 
tinuous on  [a,  b],  fia)  7^  fib),  and  \i  is  between  fia)  and  fib).  Then  /(c)  =  pi  for 
some  c  in  (a,b). 
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y 


y=m 


a 


x 


b 


Figure  2.2.5 


Proof   Suppose  that  f(a)<pi<  f{b).  The  set 

S  =  {x\a<x<b    and  /  (x)  <  pi} 


is  bounded  and  nonempty.  Let  c  =  sup  5.  We  will  show  that  f(c)  =  pi.  If  f(c)  >  pi, 
then  c  >  a  and,  since  /  is  continuous  at  c,  there  is  an  e  >  0  such  that  f(x)  >  /x  if 
c  —  €  <  x  <  c  (Exercise  2.2.15).  Therefore,  c  —  e  is  an  upper  bound  for  S,  which 
contradicts  the  definition  of  c  as  the  supremum  of  S.  If  / (c)  <  pi,  then  c  <  b  and  there  is 
an  €  >  0  such  that  / (x)  <  pi  for  c  <  x  <  c  +  e,  so  c  is  not  an  upper  bound  for  5.  This  is 
also  a  contradiction.  Therefore,  f{c)  =  pi. 

The  proof  for  the  case  where  f(b)  <  pi  <  f  (a)  can  be  obtained  by  applying  this  result 


Uniform  Continuity 

Theorem  2.2.2  and  Definition  2.2.3  imply  that  a 

function  /  is  continuous  on  a  subset  S  of  its  domain  if  for  each  e  >  0  and  each  Xg  in  S, 
there  is  a  8  >  0,  which  may  depend  upon  xq  as  well  as  €,  such  that 

\f(x)  —  f(xo)\  <  e    if     \x  —  xo\  <  S    and     x  e  Df. 

The  next  definition  introduces  another  kind  of  continuity  on  a  set  S . 

Definition  2.2.11  A  function  /  is  uniformly  continuous  on  a  subset  5  of  its  domain 
if,  for  every  e  >  0,  there  is  a  S  >  0  such  that 


We  emphasize  that  in  this  definition  S  depends  only  on  e  and  S  and  not  on  the  particular 
choice  of  x  and  x' ,  provided  that  they  are  both  in  5. 

Example  2.2.13  The  function 


to-/. 


U 


\f(x)  —  f(x')\  <  e  whenever  \x  —  x'\  <  S  and  x,  x'  e  S. 


f(x)  =  2x 
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is  uniformly  continuous  on  (—00,  00),  since 

\f(x)-f(x')\=2\x-x'\<e    if  \x-x'\<e/2. 

Example  2.2.14  If  0  <  r  <  00,  then  the  function 

g(x)  =  x2 

is  uniformly  continuous  on  [—  r,  r\.  To  see  this,  note  that 

\g(x)  -  g(x')  =  \x2  -  (x')2\  =  \x  -  x'\  \x  +  x'\  <  2r\x  -  x'\ 


\g(x)  —  g(x')\  <  €    if     \x  —  x'\  <  8  =  —    and  —  r  <  x,  x'  <  r.  ■ 

2r 

Often  a  concept  is  clarified  by  considering  its  negation:  a  function  /  is  not  uniformly 
continuous  on  S  if  there  is  an  60  >  0  such  that  if  S  is  any  positive  number,  there  are  points 
x  and  x'  in  S  such  that 

|x-jc'|<<$    but    \f(x)- f(x')\>e0. 

Example  2.2.15  The  function  g{x)  =  x2  is  uniformly  continuous  on  [— r,  r]  for  any 
finite  r  (Example  2.2.14),  but  not  on  (—00,  00).  To  see  this,  we  will  show  that  if  S  >  0 
there  are  real  numbers  x  and  x'  such  that 

\x-x'\=8/2    and     \g(x)  -  g(x')\  >  I. 

To  this  end,  we  write 

\g(x)  -  g{x')\  =  \x2  -  (x')2\  =  \x-  x'\  \x  +  x'\. 
If  \x-x'\  =  S/2andx,x'  >  1/8,  then 

\x-x'\\x  +  x'\>S-(l  +  l)  =  l. 


Example  2.2.16  The  function 


f(x)  =  cos  - 


is  continuous  on  (0,  1]  (Exercise  2.2.23 (i)).  However,  /  is  not  uniformly  continuous  on 
(0, 1],  since 


(n  +  l)ir 


=  2,    n  =  1,2, 


Examples  2.2.15  and  2.2.16  show  that  a  function  may  be  continuous  but  not  uniformly 
continuous  on  an  interval.  The  next  theorem  shows  that  this  cannot  happen  if  the  interval 
is  closed  and  bounded,  and  therefore  compact. 


66    Chapter  2  Differential  Calculus  of  Functions  of  One  Variable 


Theorem  2.2.12  Iff  is  continuous  on  a  closed  and  bounded  interval  [a,  b],  then  f 
is  uniformly  continuous  on  \a,b\. 

Proof  Suppose  that  e  >  0.  Since  /  is  continuous  on  [a,b],  for  each  t  in  [a,  b]  there  is 
a  positive  number  8t  such  that 

|/(jc)-/(0I  <  -    if    \x-t\<2St    and    xe[a,b].  (2.2.9) 
If  /,  =  (t  -  St,t  +  St),  the  collection 

M  =  {I,  \t  e  [a,b]} 

is  an  open  covering  of  [a,  b].  Since  [a,  b]  is  compact,  the  Heine-Borel  theorem  implies  that 
there  are  finitely  many  points  t\,  t%,  . . . ,  tn  in  [a,  b]  such  that  Itl ,  It2,  ■  ■  ■ ,  hn  cover  [a,  b]. 
Now  define 

8  =  mm{Stl,St2,...,Stn}.  (2.2.10) 

We  will  show  that  if 

\x-x'\<8    and    x,  x'  e  [a,  b],  (2.2.11) 

then  | /(*)-/(*')!  <€. 
From  the  triangle  inequality, 

[/(*)  -  f(x')\  =  |  (f(x)  -  f(tr))  +  (f(tr)  -  f(x'))  | 

<  \f{x)  -  f(fr)\  +  \f(tr)  ~  f(x%  {Z-ZAZ> 

Since  Itl ,  It2, . . . ,  It„  cover  [a,  b],  x  must  be  in  one  of  these  intervals.  Suppose  that  x  €  Itr ; 
that  is, 

\x-tr\<8tr.  (2.2.13) 

From  (2.2.9)  with  t  =  tr, 

\f{x)-f(tr)\  <  e-.  (2.2.14) 
From  (2.2.1 1),  (2.2.13),  and  the  triangle  inquality, 

\x'-tT\  =  \(x' -  x)  +  (x  -  tr)\  <  \x'-x\  +  \x-tr\  <  8  +  8tr  <28tr. 
Therefore,  (2.2.9)  with  t  =  tr  and  x  replaced  by  x'  implies  that 

\f(x')-f(tr)\<e-. 

This,  (2.2.12),  and  (2.2.14)  imply  that  \f(x)  -  f(x')\  <e.  U 

This  proof  again  shows  the  utility  of  the  Heine-Borel  theorem,  which  allowed  us  to 
define  8  in  (2.2.10)  as  the  smallest  of  a  finite  set  of  positive  numbers,  so  that  8  is  sure  to  be 
positive.  (An  infinite  set  of  positive  numbers  may  fail  to  have  a  smallest  positive  member; 
for  example,  consider  the  open  interval  (0,  1).) 

Corollary  2.2.13  If  f  is  continuous  on  a  set  T,  then  f  is  uniformly  continuous  on 
any  finite  closed  interval  contained  in  T. 
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Applied  to  Example  2.2.16,  Corollary  2.2.13  implies  that  the  function  g(x)  =  cos  \/x 
is  uniformly  continuous  on  [p,  1]  if  0  <  p  <  1. 

More  About  Monotonic  Functions 

Theorem  2.1.9  implies  that  if  /  is  monotonic  on  an  interval  /,  then  /  is  either  continuous 
or  has  a  jump  discontinuity  at  each  .To  in  /.  This  and  Theorem  2.2.10  provide  the  key  to 
the  proof  of  the  following  theorem. 

Theorem  2.2.14  If  f  is  monotonic  and  nonconstant  on  [a, b], then  f  is  continuous  on 
[a,  b]  if  and  only  if  its  range  R  f  =  {/ (x)  I  X  e  [a,  b]}  is  the  closed  interval  with  endpoints 
f  (a)  and  f  iff). 

Proof  We  assume  that  /  is  nondecreasing,  and  leave  the  case  where  /  is  nonincreasing 
to  you  (Exercise  2.2.34).  Theorem  2.1.9(a)  implies  that  the  set  Rf  =  {fix)  |  x  e  (a,  b)} 
is  a  subset  of  the  open  interval  (/ (a+),  / (b— )).  Therefore, 

Rf  =  {f(a)}  URfU  {fib)}  c  {/(a)}  U  (f(a+),  f{b-))  U  {fib)}.  (2.2.15) 

Now  suppose  that  /  is  continuous  on  [a,b].  Then  f(a)  =  f(a+),  fib—)  =  fib),  so 
(2.2.15)  implies  that  Rf  c  [fia),  fib)].  If  /(a)  <  pi  <  fib),  then  Theorem  2.2.10 
implies  that  pi  =  fix)  for  some  x  in  ia,b).  Hence,  Rf  =  [fia),f  (b)]. 

For  the  converse,  suppose  that  Rf  =  [f  (a),  / (£)],  Since  /(a)  <  / (a+)  and  / ib— )  < 
fib),  (2.2.15)  implies  that  fia)  =  f(a+)  and  fib-)  =  fib).  We  know  from  Theo- 
rem 2.1.9(c)  that  if  /  is  nondecreasing  and  a  <  xq  <  b,  then 

/(xo-)  <  /(•*<>)  <  /(xo+). 

If  either  of  these  inequalities  is  strict,  R f  cannot  be  an  interval.  Since  this  contradicts 
our  assumption,  /(xo— )  =  /(xo)  =  /(xo+).  Therefore,  /  is  continuous  at  xq  (Exer- 
cise 2.2.2).  We  can  now  conclude  that  /  is  continuous  on  [a,  b\.  H 

Theorem  2.2.14  implies  the  following  theorem. 

Theorem  2.2.15  Suppose  that  f  is  increasing  and  continuous  on[a,b],  and  let  fia) 
•  and  f  ib)  =  d.  Then  there  is  a  unique  function  g  defined  on  [c,  d]  such  that 

g(f(x))=x,    a<x<b,  (2.2.16) 

and 

figiy))  =y,    c<y<d.  (2.2.17) 
Moreover,  g  is  continuous  and  increasing  on  [c,d\. 

Proof  We  first  show  that  there  is  a  function  g  satisfying  (2.2.16)  and  (2.2.17).  Since  / 
is  continuous,  Theorem  2.2.14  implies  that  for  each  yo  in  [c.  d]  there  is  an  xq  in  [a,  b]  such 
that 

/(xo)  =  yo,  (2.2.18) 
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and,  since  /  is  increasing,  there  is  only  one  such  xq.  Define 

g(yo)  =  x0.  (2.2.19) 

The  definition  of  xq  is  illustrated  in  Figure  2.2.6:  with  [c,  d]  drawn  on  the  y-axis,  find  the 
intersection  of  the  line  y  =  yo  with  the  curve  y  =  f  (x)  and  drop  a  vertical  from  the 
intersection  to  the  .r-axis  to  find  xq. 


Substituting  (2.2.19)  into  (2.2.18)  yields 

f(g(yo))  =  yo, 

and  substituting  (2.2. 18)  into  (2.2.19)  yields 

g(f(*o))  =  x0. 

Dropping  the  subscripts  in  these  two  equations  yields  (2.2.16)  and  (2.2.17). 

The  uniqueness  of  g  follows  from  our  assumption  that  /  is  increasing,  and  therefore 
only  one  value  of  xq  can  satisfy  (2.2.18)  for  each  yo. 

To  see  that  g  is  increasing,  suppose  that  y\  <  y%  and  let  x\  and  x2  be  the  points  in  [a,  b] 
such  that  f(x\)  =  y\  and  fix-2)  =  yi-  Since  /  is  increasing,  x\  <  x-i-  Therefore, 

g(yi)  =  xi  <x2  =  g(y2), 

so  g  is  increasing.  Since  Rg  =  {g(y)  |  y  €  [c,  d]}  is  the  interval  [g(c),  g(d)]  =  [a,  b], 
Theorem  2.2.14  with  /  and  [a,  b]  replaced  by  g  and  [c,  d]  implies  that  g  is  continuous  on 
[c,d\.  TJ 

The  function  g  of  Theorem  2.2.15  is  the  inverse  of  /,  denoted  by  Since  (2.2.16) 
and  (2.2.17)  are  symmetric  in  /  and  g,  we  can  also  regard  /  as  the  inverse  of  g,  and  denote 
it  by  g~l. 
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Example  2.2.17  If 

fix)  =  x2,    0  <  x  <  R, 

then 

f~x(y)  =  g(y)  =  *Jy,  o<y<R2. 

Example  2.2.18  If 

/(x)  =  2jc  +  4,  0<jc<2, 

then 

r\y)  =  g{y)  =  1^-,  4<y<8. 

2.2  Exercises 


1.  Prove  Theorem  2.2.2. 

2.  Prove  that  a  function  /  is  continuous  at  xq  if  and  only  if 

lim   f(x)  =    lim   f(x)  =  f(x0). 

x->-xo—  X^-XQ  + 

3.  Determine  whether  /  is  continuous  or  discontinuous  from  the  right  or  left  at  xq. 

(a)  f{x)  =  ^    (x0  =  0)     (b)  f{x)  =         ix0  >  0) 
(c)  fix)  =  —    ix0  =  0)       (d)  / ix)  =  x2    ix0  arbitrary) 

,  N  (x  sin  l/x,  i/O, 

(e)  /(x)  =  J1  (x0  =  0) 

(x  sin  l/x,    x  ^  0 

(f)  /(x)=jQ  x  =  ()      (x0  =  0) 

f*  +  |jc|(l  +  jc)  .  1 

(g)  /(*)  =  ]  ;  sin.'  ^°     (^0  =  0) 

ll,  x  =  0 

4.  Let  /  be  defined  on  [0,  2]  by 

{x2,        0  <  x  <  1, 


/(.) 


'x  +  1,    1  <  x  <  2. 


On  which  of  the  following  intervals  is  /  continuous  according  to  Definition  2.2.3: 
[0, 1),  (0, 1),  (0, 1],  [0, 1],  [1,2),  (1,2),  (1,2],  [1,2]? 

5.  Let 

g(x)  =   -■ 

X  —  1 

On  which  of  the  following  intervals  is  g  continuous  according  to  Definition  2.2.3: 
[0,  1),  (0,  1),  (0,  1],  [l,oo),  (Loo)? 
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6.  Let 

-1    if  x  is  irrational, 
1    if  x  is  rational. 
Show  that  /  is  not  continuous  anywhere. 

7.  Let  f(x)  =  0  if  x  is  irrational  and  f(p/q)  =  l/q  if  p  and  q  are  positive  inte- 
gers with  no  common  factors.  Show  that  /  is  discontinuous  at  every  rational  and 
continuous  at  every  irrational  on  (0,  oo). 

8.  Prove:  If  /  assumes  only  finitely  many  values,  then  /  is  continuous  at  a  point  xq  in 
D°j  if  and  only  if  /  is  constant  on  some  interval  (xq  —  S,xq  +  S). 

9.  The  characteristic  function  i/rr  of  a  set  T  is  defined  by 

1,  ieT, 
0,    x  #  T. 

Show  that  \jfT  is  continuous  at  a  point  xq  if  and  only  if  xq  e  T°  U  (rc)°. 

10.  Prove:  If  /  and  g  are  continuous  on  (a,  b)  and  / (x)  =  g(x)  for  every  x  in  a  dense 
subset  (Definition  1.1.5)  of  {a,  b),  then  f{x)  =  g(x)  for  all  x  in  (a,  b). 

11.  Prove  that  the  function  g(x)  =  \ogx  is  continuous  on  (0,  oo).  Take  the  following 
properties  as  given. 

(a)  lim^i  g(x)  =  0. 

(b)  g{xi)  +  g(x2)  =  g(x\x2)  ifxux2  >  0. 

12.  Prove  that  the  function  / (x)  =  eax  is  continuous  on  (— oo,  oo).  Take  the  following 
properties  as  given. 

(a)  linwo  /(*)  =  1. 

(b)  f(xi+x2)=  f(xi)f(x2),    -oo  <  X\,Xz  <  oo. 

13.  (a)  Prove  that  the  functions  sinhx  and  coshx  are  continuous  for  all  x. 
(b)  For  what  values  of  x  are  tanhx  and  cothx  continuous? 

14.  Prove  that  the  functions  s  (x)  =  sinx  andc(x)  =  cos*  are  continuous  on  (— oo,  oo). 
Take  the  following  properties  as  given. 

(a)  linwo  c(x)  =  1. 

(b)  c(x\  —  x2)  =  c{x\)c(x2)  +  s(xi)s(x2),    —  oo  <  x\,x%  <  oo. 

(c)  s2(x)  +  c2(x)  =  1,    — oo  <  x  <  oo. 

15.  (a)   Prove:  If  /  is  continuous  at  xq  and  f(xo)  >  fi,  then  f(x)  >  ji  for  all  x  in 

some  neighborhood  of  xq  . 

(b)  State  a  result  analogous  to  (a)  for  the  case  where  / (xq)  <  pu. 

(c)  Prove:  If  / (x)  <  fi  for  all  x  in  S  and  xq  is  a  limit  point  of  S  at  which  /  is 
continuous,  then  f(xo)  <  pt. 

(d)  State  results  analogous  to  (a),  (b),  and  (c)  for  the  case  where  /  is  contin- 
uous from  the  right  or  left  at  xq. 
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16.  Let  |/|  be  the  function  whose  value  at  each  x  in  Df  is  |/(x)|.  Prove:  If  /  is 
continuous  at  xq,  then  so  is  \f\.  Is  the  converse  true? 

17.  Prove:  If  /  is  monotonic  on  [a,  b],  then  /  is  piecewise  continuous  on  [a,  b]  if  and 
only  if  /  has  only  finitely  many  discontinuities  in  [a,  b]. 

1 8 .  Prove  Theorem  2.2.5. 

19.  (a)    Show  that  if  fa,  fa,       fn  are  continuous  on  a  set  S  then  so  are  fa  +  fa  + 

•  ••  +  /„  and  fa  fa  ■■■fa. 
(b)    Use  (a)  to  show  that  a  rational  function  is  continuous  for  all  values  of  x 
except  the  zeros  of  its  denominator. 

20.  (a)   Let  fa  and  fa  be  continuous  at  .To  and  define 

F(x)  =  max  (fa(x),  fa(x))  . 

Show  that  F  is  continuous  at  xq. 

(b)  Let  fa,  fa,  ■  ■  ■ ,  fn  be  continuous  at  xo  and  define 

F(x)  =  max  (fa(x),  fa(x), f„(x)) . 

Show  that  F  is  continuous  at  xq. 

2 1 .  Find  the  domains  of  /  o  g  and  g  o  / . 

(a)  f(x)  =  */x,    g(x)  =  1  -  x2         (b)  f(x)  =  logx,    g(x)  =  sinx 

(c)  f(x)  =     1         g(x)  =  cosx      (d)/(x)  =  V^.    ^(x)  =  sin2x 

L  —  X^ 

22.  (a)    Suppose  that  yo  =  lim*-^  g(x)  exists  and  is  an  interior  point  of  D  f,  and 

that  /  is  continuous  at  yo.  Show  that 

lim  (/  o  g){x)  =  /(y0). 

(b)  State  an  analogous  result  for  limits  from  the  right. 

(c)  State  an  analogous  result  for  limits  from  the  left. 

23.  Use  Theorem  2.2.7  to  find  all  points  xq  at  which  the  following  functions  are  contin- 
uous. 

(a)  Vl  -  x2  (b)  sine-*2  (c)  log(l  +  sinx) 

(d)  (e)sm^  <f)-(^) 

(g)  (l-sin2;c)~1/2  (h)  cot(l  -  e~x2)  (i)  cos  - 

24.  Complete  the  proof  of  Theorem  2.2.9  by  showing  that  there  is  an  X2  such  that 
f(x2)  =  P- 
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25.  Prove:  If  /  is  nonconstant  and  continuous  on  an  interval  /,  then  the  set  S  = 
{y  |  y  =  f  (x)'  -X  e  /}  is  an  interval.  Moreover,  if  /  is  a  finite  closed  interval,  then 
so  is  S. 

26.  Suppose  that  /  and  g  are  defined  on  (— oo,  oo),  /  is  increasing,  and  /  o  g  is  con- 
tinuous on  (—oo,  oo).  Show  that  g  is  continuous  on  (— oo,  oo). 

27.  Let  /  be  continuous  on  [a,  b),  and  define 

F(x)  =  max  /(f),    a  <  x  <  b. 

a<t<x 

(How  do  we  know  that  F  is  well  defined?)  Show  that  F  is  continuous  on  [a,  b). 

28.  Let  /  and  g  be  uniformly  continuous  on  an  interval  S. 

(a)  Show  that  f  +  g  and  /  —  g  are  uniformly  continuous  on  S. 

(b)  Show  that  fg  is  uniformly  continuous  on  S  if  S  is  compact. 

(c)  Show  that  f/g  is  uniformly  continuous  on  S  if  S  is  compact  and  g  has  no 
zeros  in  S . 

(d)  Give  examples  showing  that  the  conclusion  of  (b)  and  (c)  may  fail  to  hold 
if  S  is  not  compact. 

(e)  State  additional  conditions  on  /  and  g  which  guarantee  that  fg  is  uniformly 
continuous  on  S  even  if  S  is  not  compact.  Do  the  same  for  / j g, 

29.  Suppose  that  /  is  uniformly  continuous  on  a  set  S,  g  is  uniformly  continuous  on  a 
set  T,  and  g(x)  e  S  for  every  x  in  T.  Show  that  /  o  g  is  uniformly  continuous  on 
T. 

30.  (a)   Prove:  If  /  is  uniformly  continuous  on  disjoint  closed  intervals  I\,  I2,  . . . , 

/„,  then  /  is  uniformly  continuous  on  lj/=i  Ij- 
(b)    Is  (a)  valid  without  the  word  "closed"? 

31.  (a)   Prove:  If  /  is  uniformly  continuous  on  a  bounded  open  interval  (a,  b),  then 

f(a+)  and  f(b-)  exist  and  are  finite.  Hint:  See  Exercise  2.1.38. 
(b)    Show  that  the  conclusion  in  (a)  does  not  follow  if  (a,  b)  is  unbounded. 

32.  Prove:  If  /  is  continuous  on  [a,  00)  and  /(oo)  exists  (finite),  then  /  is  uniformly 
continuous  on  [a,  00). 

33.  Suppose  that  /  is  defined  on  (—00,  00)  and  has  the  following  properties. 

(i)  lim  f(x)  =  1  and  (ii)  f(xi+x2)  =  f(xi)f(x2),  -00  <Xi,X2<oo. 
Prove: 

(a)  f{x)  >  Oforalljc. 

(b)  / irx)  =  [f  ix)]r  if  r  is  rational. 

(c)  If  /(l)  =  1  then  /  is  constant. 
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(d)    If  f{\)  =  p  >  1,  then  /  is  increasing, 

lim  / (x)  =  oo,    and       lim   / (x)  =  0. 

x— too  X  — oo 

(Thus,  / (x)  =  eax  has  these  properties  if  a  >  0.) 
HINT:  See  Exercises  2.2.10  and  2.2.12. 
34.     Prove  Theorem  2.2.14  in  the  case  where  /  is  nonincreasing. 


2.3  DIFFERENTIABLE  FUNCTIONS  OF  ONE  VARIABLE 


In  calculus  you  studied  differentiation,  emphasizing  rules  for  calculating  derivatives.  Here 
we  consider  the  theoretical  properties  of  differentiable  functions.  In  doing  this,  we  assume 
that  you  know  how  to  differentiate  elementary  functions  such  as  x",  ex,  and  sinx,  and  we 
will  use  such  functions  in  examples. 


Definition  of  the  Derivative 


Definition  2.3.1  A  function  /  is  differentiable  at  an  interior  point  xq  of  its  domain  if 
the  difference  quotient 

/(*)  -  /(JC0)  , 

 ,      X  Xq, 

X  —  Xq 

approaches  a  limit  as  x  approaches  xq,  in  which  case  the  limit  is  called  the  derivative  of  f 
at  xq,  and  is  denoted  by  f'(xo);  thus, 

ft    \      v  f(x)~f(xo) 

/  (xo)  =  hm   .  (2.3.1) 

X-+XQ        X  —  Xq 

It  is  sometimes  convenient  to  let  x  =  xq  +  h  and  write  (2.3.1)  as 

fll    %     ,.     f(xo  +h)-  f(x0) 

f  (x0)  =  hm  .  ■ 

/i-s-o  h 

If  /  is  defined  on  an  open  set  5,  we  say  that  /  is  differentiable  on  S  if  /  is  differentiable 
at  every  point  of  S.  If  /  is  differentiable  on  5,  then  /'  is  a  function  on  S.  We  say  that 
/  is  continuously  differentiable  on  S  if  /'  is  continuous  on  5.  If  /  is  differentiable  on  a 
neighborhood  of  xq,  it  is  reasonable  to  ask  if  /'  is  differentiable  at  xq.  If  so,  we  denote  the 
derivative  of  /'  at  xq  by  f"(xo).  This  is  the  second  derivative  of  f  at  xq,  and  it  is  also 
denoted  by  f^2\xo).  Continuing  inductively,  if  is  defined  on  a  neighborhood  of 

xq,  then  the  nth  derivative  of  f  at  xq,  denoted  by  f^n'(xo),  is  the  derivative  of  Z*-"-1^  at 
xq.  For  convenience  we  define  the  zeroth  derivative  of  /  to  be  /  itself;  thus 

f(0)  =  f- 

We  assume  that  you  are  familiar  with  the  other  standard  notations  for  derivatives;  for 
example, 

fi2)  =  f",     f0)  =  f". 
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and  so  on,  and 

d_lL  =  fin) 

dx" 


Example  2.3.1  If  n  is  a  positive  integer  and 

f(x)  =  x", 

then 


/(■*)  ~  f(xo)  _  x"  —  xo  _  x  —  x0  -i  -    n-k-l  k 

—  —  /      X  Xn  , 

X  —  Xo  X  —  Xo         X  —  Xo 

k=0 


/'(xo)  =  lim  Yxn~k~l4  =  nxTx. 

X—>-XQ  ^— ' 

k=0 

Since  this  holds  for  every  Xo,  we  drop  the  subscript  and  write 


f(x)  =  nx" 


or 


—  ixn)  =  nx* 
dx 


To  derive  differentiation  formulas  for  elementary  functions  such  as  sinx,  cosx,  and  ex 
directly  from  Definition  2.3.1  requires  estimates  based  on  the  properties  of  these  functions. 
Since  this  is  done  in  calculus,  we  will  not  repeat  it  here. 


Interpretations  of  the  Derivative 

If  / ix)  is  the  position  of  a  particle  at  time  x  ^  xq,  the  difference  quotient 

fix)  -  fjxp) 

X  -  Xo 

is  the  average  velocity  of  the  particle  between  times  xq  and  x.  As  x  approaches  xq,  the 
average  applies  to  shorter  and  shorter  intervals.  Therefore,  it  makes  sense  to  regard  the  limit 
(2.3.1),  if  it  exists,  as  the  particle's  instantaneous  velocity  at  time  Xo-  This  interpretation 
may  be  useful  even  if  x  is  not  time,  so  we  often  regard  f'ixo)  as  the  instantaneous  rate  of 
change  of  f  (x)  at  Xq,  regardless  of  the  specific  nature  of  the  variable  x.  The  derivative  also 
has  a  geometric  interpretation.  The  equation  of  the  line  through  two  points  (xo,  / (xo))  and 
(xi,  /(xi))  on  the  curve  y  =  fix)  (Figure  2.3.1)  is 

y  =  Jixo)  H  (x  -  x0). 

xi  -x0 

Varying  x\  generates  lines  through  (xq,  / (xq))  that  rotate  into  the  line 


y  =  f(xo)  +  f'ixo)ix  -  x0) 


(2.3.2) 
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as  x\  approaches  xq.  This  is  the  tangent  to  the  curve  y  =  fix)  at  the  point  (xq,  f(xo)). 
Figure  2.3.2  depicts  the  situation  for  various  values  of  x\ . 


Figure  2.3.1 


Figure  2.3.2 

Here  is  a  less  intuitive  definition  of  the  tangent  line:  If  the  function 

T(x)  =  /(x0)  +  m(x  -  x0) 
approximates  /  so  well  near  Xq  that 

,.  m  -  t(X) 

hm    =  0, 

x^f-xo       x  —  Xo 

we  say  that  the  line  y  =  T(x)  is  tangent  to  the  curve  y  =  f  (x)  at  (xq  ,  / (xo)). 
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This  tangent  line  exists  if  and  only  if  f'(xo)  exists,  in  which  case  m  is  uniquely  determined 
by  m  =  f'(xo)  (Exercise  2.3.1).  Thus,  (2.3.2)  is  the  equation  of  the  tangent  line. 

We  will  use  the  following  lemma  to  study  differentiable  functions. 
Lemma  2.3.2  If  f  is  differentiable  at  xq,  then 

f{x)  =  /(jc0)  +  [/'(jc0)  +  E(x)](x  -  jc0),  (2.3.3) 
where  E  is  defined  on  a  neighborhood  of  Xq  and 

lim  E(x)  =  E{xq)  =  0. 

X  — >XQ 

Proof  Define 

(f(x)-f(x0) 

E(x)  =  \      x-X0      -f{Xo)'  (2.3.4) 

|0,  X  =  Xq. 

Solving  (2.3.4)  for  / (x)  yields  (2.3.3)  if  x  ^  xq,  and  (2.3.3)  is  obvious  if  x  =  xq.  Defini- 
tion 2.3.1  implies  that  limx^xo  E{x)  =  0.  We  defined  E{xq)  =  0  to  make  E  continuous 
at.T0.  TJ 
Since  the  right  side  of  (2.3.3)  is  continuous  at  xq,  so  is  the  left.  This  yields  the  following 
theorem. 

Theorem  2.3.3  Iff  is  differentiable  at  xq,  then  f  is  continuous  at  xq. 

The  converse  of  this  theorem  is  false,  since  a  function  may  be  continuous  at  a  point 
without  being  differentiable  at  the  point. 

Example  2.3.2  The  function 

fix)  =  \x\ 


can  be  written  as 


or  as 


From  (2.3.5), 


and  from  (2.3.6), 


f(x)  =    x,  x>0,  (2.3.5) 

f(x)  =  -x,  x  <  0.  (2.3.6) 

/'(*)  =    1,  x  >  0, 

f'(x)  =  -1,  x  <  0. 


Neither  (2.3.5)  nor  (2.3.6)  holds  throughout  any  neighborhood  of  0,  so  neither  can  be  used 
alone  to  calculate  /'(0).  In  fact,  since  the  one-sided  limits 

lim   =  hm  —  (2.3.7) 

x-i-0+       x  —  0  x->-0+  x 


and 


fix)  -  /(0)  -x 
lim  LLl — 1±±  =  Hm  —  =  _i  (2.3.8) 

x^-0-        X  —  0  X-*Q—  X 
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are  different, 

hm  

x->0       x  —  0 

does  not  exist  (Theorem  2.1.6);  thus,  /  is  not  differentiable  at  0,  even  though  it  is  continu- 
ous at  0. 


Interchanging  Differentiation  and  Arithmetic  Operations 

The  following  theorem  should  be  familiar  from  calculus. 

Theorem  2.3.4  If  f  and  g  are  differentiable  at  xq,  then  so  are  f  +  g,  f  —  g,  and  fg, 
with 

(a)  (/  +  s)'(xo)  =  /'(*<>) +  g,(x„); 

(b)  (/-g)'(xo)  =  /'(xo)-g(x0); 

(c)  (fg)'(x0)  =  f'(xo)g(x0)  +  /(xo)g'(xo). 

The  quotient  f/g  is  differentiable  at  Xq  ifg(xo)  ^  0,  with 


/  Y  ,    ,     /'(xo)g(xo)  -  /(xo)g'(xo) 
(xo)  =  


Proof  The  proof  is  accomplished  by  forming  the  appropriate  difference  quotients  and 
applying  Definition  2.3.1  and  Theorem  2.1.4.  We  will  prove  (c)  and  leave  the  rest  to  you 
(Exercises  2.3.9,  2.3.10,  and  2.3.11). 

The  trick  is  to  add  and  subtract  the  right  quantity  in  the  numerator  of  the  difference 
quotient  for  (  fg)'  (xq);  thus, 

f(x)g{x)  -  f(x0)g(x0)  _  f(x)g(x)  -  f(x0)g(x)  +  f(x0)g(x)  -  f(x0)g(x0) 

X  —  Xo  X  —  Xq 

/(x)-/(x0)     .  ^(x)-g(xo) 
=  g(x)  +  /(x0)  . 

X  —  Xq  X  —  Xq 

The  difference  quotients  on  the  right  approach  f'(xo)  and  g'Oto)  as  x  approaches  xq,  and 
limx_>xo  g(x)  =  g(xo)  (Theorem  2.3.3).  This  proves  (c).  TJ 

The  Chain  Rule 

Here  is  the  rule  for  differentiating  a  composite  function. 

Theorem  2.3.5  (The  Chain  Rule)  Suppose  that  g  is  differentiable  at  xq  and  f 
is  differentiable  at  g(xo).  Then  the  composite  function  h  =  f  o  g,  defined  by 

h(x)  =  f{g(x)), 

is  differentiable  at  Xq,  with 

h'(x0)  =  f'(g(x0))g'(x0). 
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Proof    Since  /  is  differentiable  at  g(xo),  Lemma  2.3.2  implies  that 
fit)  ~  f(g(xo))  =  [f'(g(xo))  +  E(t))[t  -g(x0)l 

where 

lim    E(t)  =  E(g(x0))  =  0.  (2.3.9) 

t^-g(.xo) 

Letting  t  =  g(x)  yields 

f(g(x))  -  f(g(x0))  =  [f'(g(x0))  +  E(g(x))][g(x)  -  g(x0)]. 
Since  h(x)  =  f  (g(x)),  this  implies  that 

 =  [/  te(*o))  +  #&•(*))]  •  (2.3.10) 

X  —  Xo  X  —  Xq 

Since  g  is  continuous  at  xo  (Theorem  2.3.3),  (2.3.9)  and  Theorem  2.2.7  imply  that 

lim  E(g(x))  =  E(g(x0))  =  0. 

X^-XQ 

Therefore,  (2.3.10)  implies  that 

u'(    \      v     h(x)-h(x0)        ,  , 

h  (x0)  =  lim   =  /  (g(x0))g  (x0), 

x^xq        X  —  Xq 

as  stated.  H 
Example  2.3.3  If 

f(x)  =  sinx    and    g(x)  =  — .    i  /  0, 

x 

then 

h(x)  =  f(g(x))  =  sin  -,  i/O, 
x 

and 

h'(x)  =  f'(g(x))g(x)  =  (cos  1  j  j  ,    x  ^  0.  ■ 

It  may  seem  reasonable  to  justify  the  chain  rule  by  writing 
h(x)-h(x0)  _  f(g(x))-f(g(x0)) 

X  —  Xo  X  —  Xq 

_  f(g(x))-f(g(x0))  g(x)-g(x0) 
g(x)-g(x0)  x-x0 

and  arguing  that 


lim   —  —        =  /  (g(x0)) 

x^xq        g(x)  -  g(x0) 
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(because  limx->XQ  g(x)  =  g(x0))  and 

hm   =  g  (x0). 

x^xq        x  —  Xq 

However,  this  is  not  a  valid  proof  (Exercise  2.3.13). 
One-Sided  Derivatives 

One-sided  limits  of  difference  quotients  such  as  (2.3.7)  and  (2.3.8)  in  Example  2.3.2  are 
called  one-sided  or  right-  and  left-hand  derivatives.  That  is,  if  /  is  defined  on  [xo,  b),  the 
right-hand  derivative  of  f  at  xq  is  defined  to  be 

w,     s  ,.  f(x)-f(x0) 

f+(xo)  =  hm   

x->xo+        X  —  Xq 

if  the  limit  exists,  while  if  /  is  defined  on  {a,  jco],  the  left-hand  derivative  of  f  at  xq  is 
defined  to  be 

f-(xo)  =  hm   

x-^xq-        X  —  Xo 

if  the  limit  exists.  Theorem  2.1.6  implies  that  /  is  differentiable  at  xq  if  and  only  if  /f  (xo) 
and  fl(xo)  exist  and  are  equal,  in  which  case 

/'(x0)  =  fi(x0)  =  fi(x0). 
In  Example  2.3.2,  /|(0)  =  1  and  fl(0)  =  -1. 

Example  2.3.4  If 


3,  jc  <  0, 


/(x)=  \  i  (2.3.11) 

I  x  sin  — ,    x  >  0, 

x 

then 

{3x2,  x  <  0, 

1  1  (2.3.12) 

2x  sin  cos—,    x  >  0. 
x  x 

Since  neither  formula  in  (2.3.1 1)  holds  for  all  x  in  any  neighborhood  of  0,  we  cannot  simply 
differentiate  either  to  obtain  /'(0);  instead,  we  calculate 

9  1 

x  sin  0  j 

/X(0)  =   lim   ^         =   lim  xsin-  =  0. 

^  x^0+        X  -  0  x^0+  X 

x3  —  0 

f!_(G)  =  Hm   =  lim  x2  =  0; 

X-M)-  X  —  0  x->0- 

hence,  /'(0)  =  /' (0)  =  /Z(0)  =  0.  ■ 
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This  example  shows  that  there  is  a  difference  between  a  one-sided  derivative  and  a  one- 
sided limit  of  a  derivative,  since  fl(0)  =  0,  but,  from  (2.3.12),  /'(0+)  =  limx^o+  f'(x) 
does  not  exist.  It  also  shows  that  a  derivative  may  exist  in  a  neighborhood  of  a  point  xq 
(=  0  in  this  case),  but  be  discontinuous  at  .To. 

Exercise  2.3.4  justifies  the  method  used  in  Example  2.3.4  to  compute  f'(x)  for  x  ^  0. 
Definition  2.3.6 

(a)  We  say  that  /  is  differentiable  on  the  closed  interval  [a,  b]  if  /  is  differentiable  on 
the  open  interval  (a,  b)  and  /1(a)  and  fl(b)  both  exist. 

(b)  We  say  that  /  is  continuously  differentiable  on  [a,  b]  if  /  is  differentiable  on  [a,b], 
f  is  continuous  on  (a,b),  f\_(a)  =  f'(a+),  and  f!_(b)  =  fib—). 


Extreme  Values 

We  say  that  / (xo)  is  a  local  extreme  value  of  /  if  there  is  a  S  >  0  such  that  / (x)  —  / (xo) 
does  not  change  sign  on 

(xo-S,x0  +  S)nDf.  (2.3.13) 
More  specifically,  / (xo)  is  a  local  maximum  value  of  /  if 

/(*)  <  /(*o)  (23.14) 

or  a  local  minimum  value  of  /  if 

/(*)  >  /(*o)  (23.15) 

for  all  x  in  the  set  (2.3.13).  The  point  xq  is  called  a  local  extreme  point  of  /,  or,  more 
specifically,  a  local  maximum  or  /oca/  minimum  point  of  /. 

A 


Figure  2.3.3 
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Example  2.3.5  If 

1,  -\<x<-\ 
/U)  J       1*1.  'J  <  v  < 


—  2' 


1  JtX  , 

— =  sin — .      4  <  x  <  4 

(Figure  2.3.3),  then  0,  3,  and  every  x  in  (—1,  —  A)  are  local  minimum  points  of  /,  while  1, 
4,  and  every  x  in  (—  1,  — ~]  are  local  maximum  points.  ■ 

It  is  geometrically  plausible  that  if  the  curve  y  =  f  (x)  has  a  tangent  at  a  local  extreme 
point  of  /,  then  the  tangent  must  be  horizontal;  that  is,  have  zero  slope.  (For  example,  in 
Figure  2.3.3,  see  x  =  l,x  =  3,  and  every  x  in  (— 1,  —1/2).)  The  following  theorem  shows 
that  this  must  be  so. 

Theorem  2.3.7  Iff  is  differentiable  at  a  local  extreme  point  xq  e  D°j- ,  then  f '  {xq)  =  0 

Proof   We  will  show  that  xo  is  not  a  local  extreme  point  of  /  if  f'(xo)  ^  0.  From 
Lemma  2.3.2, 

f(X)  -  f(X0)  =  f'(x0)  +  E(x),  (2.3.16) 

X  —  Xo 

where  limjt;^xo  E{x)  =  0.  Therefore,  if  f'(xo)  ^  0,  there  is  a  S  >  0  such  that 

\E(x)\  <\f'(x0)\    if  \x-x0\<8, 

and  the  right  side  of  (2.3.16)  must  have  the  same  sign  as  f'(xo)  for  \x  —  xq\  <  8.  Since 
the  same  is  true  of  the  left  side,  / (x)  —  / (xo)  must  change  sign  in  every  neighborhood  of 
xo  (since  x  —  Xo  does).  Therefore,  neither  (2.3.14)  nor  (2.3.15)  can  hold  for  all  x  in  any 
interval  about  xo .  H 

If  f'{xo)  =  0,  we  say  that  Xo  is  a  critical  point  of  /.  Theorem  2.3.7  says  that  every 
local  extreme  point  of  /  at  which  /  is  differentiable  is  a  critical  point  of  / .  The  converse 
is  false.  For  example,  0  is  a  critical  point  of  / (x)  =  x3,  but  not  a  local  extreme  point. 


Rolle's  Theorem 

The  use  of  Theorem  2.3.7  for  finding  local  extreme  points  is  covered  in  calculus,  so  we  will 
not  pursue  it  here.  However,  we  will  use  Theorem  2.3.7  to  prove  the  following  fundamental 
theorem,  which  says  that  if  a  curve  y  =  f  (x)  intersects  a  horizontal  line  at  x  =  a  and 
x  =  b  and  has  a  tangent  at  (x,  / (x))  for  every  x  in  (a,  b),  then  there  is  a  point  c  in  (a,  b) 
such  that  the  tangent  to  the  curve  at  (c,  / (c))  is  horizontal  (Figure  2.3.4). 
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y 
i 


Figure  2.3.4 

Theorem  2.3.8  (Rolle's  Theorem)  Suppose  that  f  is  continuous  on  the  closed 
interval  [a,b]  and  differentiable  on  the  open  interval  (a,b),  and  f  (a)  =  f(b).  Then 
f'(c)  =  0  for  some  c  in  the  open  interval  {a,  b). 

Proof  Since  /  is  continuous  on  [a,  b],  f  attains  a  maximum  and  a  minimum  value  on 
[a,  b]  (Theorem  2.2.9).  If  these  two  extreme  values  are  the  same,  then  /  is  constant  on 
(a,  b),  so  f'(x)  =  0  for  all  x  in  (a,  b).  If  the  extreme  values  differ,  then  at  least  one  must 
be  attained  at  some  point  c  in  the  open  interval  {a,  b),  and  f'(c)  =  0,  by  Theorem  2.3.7. 
U 

Intermediate  Values  of  Derivatives 

A  derivative  may  exist  on  an  interval  [a,  b]  without  being  continuous  on  [a,b].  Neverthe- 
less, an  intermediate  value  theorem  similar  to  Theorem  2.2.10  applies  to  derivatives. 

Theorem  2.3.9  (Intermediate  Value  Theorem  for  Derivatives)  Suppose 
that  f  is  differentiable  on  [a,b],  f  (a)  7^  f'(b),  and  [i  is  between  f'{a)  and  f '(b).  Then 
f'ifi)  =  \i  for  some  c  in  (a,  b). 

Proof    Suppose  first  that 

f'(a)  <  fi  <  f'(b)  (2.3.17) 

and  define 

g(x)  =  f(x)-  fix. 

Then 

g'(x)  =  f'(x)  -  /a,    a<x<b,  (2.3.18) 

and  (2.3.17)  implies  that 

g'(a)  <  0    and    g'(b)  >  0.  (2.3.19) 

Since  g  is  continuous  on  [a,  b],  g  attains  a  minimum  at  some  point  c  in  [a,  b].  Lemma  2.3.2 
and  (2.3.19)  imply  that  there  is  a  S  >  0  such  that 


g(x)  <  g(a),    a  <  x  <  a  +  8,    and    g(x)  <  g{b),    b  —  S  <  x  <  b 
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(Exercise  2.3.3),  and  therefore  c  ^  a  and  c  ^  b.  Hence,  a  <  c  <  b,  and  therefore 
g'(c)  =  0,  by  Theorem  2.3.7.  From  (2.3.18),  f'{c)  =  \l. 

The  proof  for  the  case  where  f'(b)  <  fi  <  f'(a)  can  be  obtained  by  applying  this  result 
to-/.  H 

Mean  Value  Theorems 

Theorem  2.3.10  (Generalized  Mean  Value  Theorem)  Iff  and  g  are  con- 
tinuous on  the  closed  interval  [a,  b]  and  differentiable  on  the  open  interval  {a,  b),  then 

[g(b)  -  g(a)]f'(c)  =  [f(b)  -  f(a)]g'(c)  (2.3.20) 

for  some  c  in  (a,  b). 

Proof   The  function 

h(x)  =  [g(b)  -  g(a)]f(x)  -  [f{b)  -  f(a)]g(x) 

is  continuous  on  [a,  b]  and  differentiable  on  (a,  b),  and 

h(a)  =  h{b)  =  g{b)f{a)  -  f(b)g(a). 

Therefore,  Rolle's  theorem  implies  that  h'(c)  =  0  for  some  c  in  (a,  b).  Since 

h'(c)  =  [g(b)  -  g(a)]f'(c)  -  [f{b)  -  f(a)]g'(c), 

this  implies  (2.3.20).  TJ 
The  following  special  case  of  Theorem  2.3.10  is  important  enough  to  be  stated  separately. 

Theorem  2.3.11  (Mean  Value  Theorem)      /  is  continuous  on  the  closed 
interval  [a,  b]  and  differentiable  on  the  open  interval  {a,  b),  then 

_  m-m 

b  —  a 

for  some  c  in  (a,  b). 

Proof   Apply  Theorem  2.3.10  with  g(x)  =  x. 

TJ 

Theorem  2.3. 1 1  implies  that  the  tangent  to  the  curve  y  =  f  (x)  at  (c,  / (c))  is  parallel  to 
the  line  connecting  the  points  (a,  / (a))  and  (b,  f  (b))  on  the  curve  (Figure  2.3.5,  page  84). 

Consequences  of  the  Mean  Value  Theorem 

If  /  is  differentiable  on  (a,b)  and  x\,  Xj  e  (a,b)  then  /  is  continuous  on  the  closed 
interval  with  endpoints  x\  and  x-i  and  differentiable  on  its  interior.  Hence,  the  mean  value 
theorem  implies  that 

f(x2)  -  f(x{)  =  f'(c)(x2  -  Xl) 

for  some  c  between  x\  and  x2.  (This  is  true  whether  X\  <  Xz  or  x%  <  X\.)  The  next  three 
theorems  follow  from  this. 
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Theorem  2.3.12  //  f'Qc)  —  0  for  all  x  in  (a,b),  then  f  is  constant  on  (a,b). 

Theorem  2.3.13  If  f  exists  and  does  not  change  sign  on  (a,  b),  then  f  is  monotonic 
on  (a,b)  :  increasing,  nondecreasing,  decreasing,  or  nonincreasing  as 

f'(x)>0,     /'(*)>  0,     /'(*)<  0,    or    /'(*)<  0, 

respectively,  for  all  x  in  (a,b). 

Theorem  2.3.14  // 

\f'(x)\<M,  a<x<b, 

then 

\f{x)-f{x')\  <M\x-x'\,    x,x'&(a,b).  (2.3.21) 

A  function  that  satisfies  an  inequality  like  (2.3.21)  for  all  x  and  x'  in  an  interval  is  said 
to  satisfy  a  Lipschitz  condition  on  the  interval. 


y 


Figure  2.3.5 
2.3  Exercises 


1.    Prove  that  a  function  /  is  differentiable  at  xq  if  and  only  if 

,.      f  (x)  -  f  (x0)  -  m(x  -  x0) 
hm    =  0 

x^x0  X  —  Xq 

for  some  constant  m.  In  this  case,  f'(xo)  =  m. 
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2.  Prove:  If  /  is  defined  on  a  neighborhood  of  Xq,  then  /  is  differentiable  at  .To  if  and 
only  if  the  discontinuity  of 

h(x)  =   

X  —  Xq 

at  xq  is  removable. 

3.  Use  Lemma  2.3.2  to  prove  that  if  f'(xo)  >  0,  there  is  a  S  >  0  such  that 

/ (x)  <  f  (xq)  if  Xq  —  S  <  X  <  Xq  and  / (x)  >  f  (Xo)  if  Xq  <  X  <  Xq  +  S. 

4.  Suppose  that  p  is  continuous  on  {a,  c]  and  differentiable  on  (a,  c),  while  q  is  con- 
tinuous on  [c,  b)  and  differentiable  on  (c,  b).  Let 


/(*)  = 

(a)   Show  that 


p(x),  a  <  x  <  c, 
^r(x),    c  <  x  <  b. 


fix)  = 


[  p'{x),  a  <  x  <  c, 
q'(x),    c  <  x  <  b. 


(b)    Under  what  additional  conditions  on  p  and  q  does  f'(c)  exist?  Prove  that 
your  stated  conditions  are  necessary  and  sufficient. 

5.  Find  all  derivatives  of  / (x)  =  xn~x \x\,  where  n  is  a  positive  integer. 

6.  Suppose  that  f'(0)  exists  and  f(x  +  y)  =  f(x)f  (y)  for  all  x  and  y.  Prove  that  /' 
exists  for  all  x. 

7.  Suppose  that  c'(0)  =  a  and  s'(0)  =  b  where  a2  +  b2  ^  0,  and 

c(x  +  y)  =  c(x)c(y)  -  s(x)s(y) 
s(x  +  y)  =  s(x)c(y)  +  c(x)s(y) 

for  all  x  and  y . 

(a)  Show  that  c  and  s  are  differentiable  on  (— oo,  oo),  and  find  c'  and  s'  in  terms 
of  c,  s,  a,  and  b. 

(b)  (For  those  who  have  studied  differential  equations.)  Find  c  and  s  explicitly. 

8.  (a)    Suppose  that  /  and  g  are  differentiable  at  xq,  /(xq)  =  g(xo)  =  0,  and 

g'(xo)  7^  0.  Without  using  LHospital's  rule,  show  that 

f(x)  f'(x0) 
hm    =  —  . 

x-+xo  g{x)  g'(x0) 

(b)    State  the  corresponding  results  for  one-sided  limits. 

9.  Prove  Theorem  2.3.4(a). 
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10.  Prove  Theorem  2.3.4(b). 

11.  Prove  Theorem  2.3.4(d). 

12.  Prove  by  induction:  If  n  >  1  and       (xq)  and  g'"^  (xq)  exist,  then  so  does  (fg)^  (xq), 
and 

(fg)W(xo)  =  £  (n)f{m\xo)g{n-m)(x0). 

m=0  V  / 

Hint:  See  Exercise  1.2.19.  This  is  Leibniz's  rule  for  differentiating  a  product. 

13.  What  is  wrong  with  the  "proof"  of  the  chain  rule  suggested  after  Example  2.3.3? 
Correct  it. 

14.  Suppose  that  /  is  continuous  and  increasing  on  [a,  b].  Let  /  be  differentiable  at  a 
point  xq  in  (a,  b),  with  f'(xo)  ^  0.  If  g  is  the  inverse  of  /  Theorem  2.2.15),  show 
thatg'(/(x0))  =  l//'(x0). 

15.  (a)    Show  that  f\  (a)  =  f'(a+)  if  both  quantities  exist. 

(b)  Example  2.3.4  shows  that  fl_(a)  may  exist  even  if  f'(a+)  does  not.  Give  an 
example  where  f'{a+)  exists  but  fL{a)  does  not. 

(c)  Complete  the  following  statement  so  it  becomes  a  theorem,  and  prove  the 
theorem:  "If  f'(a+)  exists  and  /  is  at  a,  then  fL{a)  =  f'(a+)." 

16.  Show  that  f{a+)  and  f(b-)  exist  (finite)  if  /'  is  bounded  on  (a,b).  Hint:  See 
Exercise  2.1.38. 

17.  Suppose  that  /  is  continuous  on  [a,  b],  f\_{a)  exists,  and  [i  is  between  fl_(a)  and 
(/ (b)  —  f  (a))/(b  —  a).  Show  that  /(c)  —  / (a)  =  n(c  —  a)  for  some  c  in  (a,  b). 

18.  Suppose  that  /  is  continuous  on  [a,  b],  f+(a)  <  [i  <  f!_(b),  and 

(f(b)-f(a))/(b-  a) 

Show  that  either  /(c)  —  / (a)  =  fi(c  —  a)  or  / (c)  —  f(b)  =  yt(c  —  b)  for  some  c 
in  {a,  b). 

19.  Let 

sin  x 

f(x)  =  ,  x^O. 

X 

(a)  Define  /(0)  so  that  /  is  continuous  at  x  =  0.  Hint:  Use  Exercise  2.3.8. 

(b)  Show  that  if  X  is  a  local  extreme  point  of  /,  then 

\m\  =  a  +  Y2r1/2. 

HINT:  Express  sinx  and  cosx  in  terms  of  f(x)  and  fix),  and  add  their 
squares  to  obtain  a  useful  identity. 

(c)  Show  that  |  / (x)\  <  1  for  all  x.  For  what  value  of  x  is  equality  attained? 
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20.     Let  n  be  a  positive  integer  and 


sin«.r 


/(*)  = 


x  7^  kit    (k  =  integer). 


n  sin  x 


(a) 
(b) 


Define  / (kn)  so  that  /  is  continuous  at  kit .  Hint:  Use  Exercise  2.3.8. 
Show  that  if  X  is  a  local  extreme  point  of  /,  then 


\f{x)\  =  [l  +  (»2-l)sin2x 


HINT:  Express  sinnx  and  cosnx  in  terms  of  fix)  and  f'(x),  and  add  their 
squares  to  obtain  a  useful  identity. 

(c)    Show  that  |  / (x)\  <  1  for  all  x.  For  what  values  of  x  is  equality  attained? 

21.     We  say  that  /  has  at  least  n  zeros,  counting  multiplicities,  on  an  interval  /  if  there 
are  distinct  points  x\,  Xi, . . . ,  xp  in  /  such  that 


and  n\  +  ■  ■  ■  +  np  =  n.  Prove:  If  /  is  differentiable  and  has  at  least  n  zeros, 
counting  multiplicities,  on  an  interval  /,  then  /'  has  at  least  n  —  1  zeros,  counting 
multiplicities,  on  / . 

22.  Give  an  example  of  a  function  /  such  that  /'  exists  on  an  interval  (a,  b)  and  has  a 
jump  discontinuity  at  a  point  xq  in  (a,  b),  or  show  that  there  is  no  such  function. 

23.  Let  x\,  X2,  ■  ■  ■ ,  xn  and  y\,  yi,       yn  be  in  {a,  b)  and  yi  <  x, ,  1  <  i  <  n.  Show 
that  if  /  is  differentiable  on  (a,  b),  then 


for  some  c  in  (a,  b). 

24.     Prove  or  give  a  counterexample:  If  /  is  differentiable  on  a  neighborhood  of  xq,  then 
/  satisfies  a  Lipschitz  condition  on  some  neighborhood  of  xq. 


25.  Let 

/"(*)  +  p{x)f{x)  =  0    and    g"(x)  +  p(x)g(x)  =  0,    a  <  x  <  b. 


/t/)(Xi)  =  0,    0<y  <m-\,  l<i<p, 


n  n 


£  [f(xi)  -  f(yt)]  =  /'(c)  £  (x,  -  y,) 


(a)  Show  that  W  =  f'g  —  fg'  is  constant  on  (a,  b). 

(b)  Prove:  If  W  ^  0  and  f(xi)  =  f(x2)  =  0  where  a  <  x\  <  x2  <  b,  then 
g(c)  =  0  for  some  c  in  (xi,  X2).  Hint:  Consider  f  / g. 
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26.     Suppose  that  we  extend  the  definition  of  differentiability  by  saying  that  /  is  differ- 
entiable  at  xq  if 

J  (xq)  =  hm   

x-+x0        X  —  Xq 

exists  in  the  extended  reals.  Show  that  if 


/(*)  = 


\  <fx,  x  >  0, 
'  —J—x,    x  <  0, 


then  f'(0)  =  oo. 

27.  Prove  or  give  a  counterexample:  If  /  is  differentiable  at  xq  in  the  extended  sense  of 
Exercise  2.3.26,  then  /  is  continuous  at  xq. 

28.  Assume  that  /  is  differentiable  on  (— oo,  oo)  and  xq  is  a  critical  point  of  /. 

(a)  Leth(x)  =  f(x)g(x),  where  g  is  differentiable  on  (— oo,  oo)  and 

f(x0)g'(x0)  ?  0. 

Show  that  the  tangent  line  to  the  curve  y  =  h(x)  at  (xq,  h(xo))  and  the  tangent 
line  to  the  curve  y  =  g(x)  at  (xq,  g(xo)  intersect  on  the  x-axis. 

(b)  Suppose  that  f(xo)  ^  0.  Let  h(x)  =  f(x)(x  —  x\),  where  x\  is  arbitrary. 
Show  that  the  tangent  line  to  the  curve  y  =  h(x)  at  (xo,  h(xo))  intersects  the 
.T-axis  at  ~x  =  x\. 

(c)  Suppose  that  f(xo)  ^  0.  Let  h(x)  =  f(x)(x  —  x\)2,  where  x\  ^  xq.  Show 
that  the  tangent  line  to  the  curve  y  =  h(x)  at  (^o,  ^(^o))  intersects  the  x-axis 
at  the  midpoint  of  the  interval  with  endpoints  xq  and  x\ . 

(d)  Let  h(x)  =  (ax2  +  bx  +  c)(x  —  x\),  where  a  ^  0  and  b2  —  4ac  ^  0.  Let 

xo  =  •  Show  that  the  tangent  line  to  the  curve  y  =  h(x)  at  (xq,  h(xo)) 

2a 

intersects  the  x-axis  at  x  =  x\. 

(e)  Let  h  be  a  cubic  polynomial  with  zeros  a,  ft,  and  y,  where  a  and  ft  are  distinct 

a  +  P 

and  y  is  real.  Let  xq  =  — - — .  Show  that  the  tangent  line  to  the  curve 
y  =  h(x)  at  (xq,  h(xo))  intersects  the  axis  at  X  =  y. 


2.4  L'HOSPITAL'S  RULE 


The  method  of  Theorem  2.1.4  for  finding  limits  of  the  sum,  difference,  product,  and  quo- 
tient of  functions  breaks  down  in  connection  with  indeterminate  forms.  The  generalized 
mean  value  theorem  (Theorem  2.3.10)  leads  to  a  method  for  evaluating  limits  of  indetermi- 
nate forms. 

Theorem  2.4.1  (L'Hospital's  Rule)  Suppose  that  f  and  g  are  differentiable 
and  g'  has  no  zeros  on  (a,b).  Let 

lim  f(x)  =   lim  g(x)  =  0  (2.4.1) 

x—>b—  x—>b— 
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and  suppose  that 


Then 


lim  f{x)  =  ±00    and      lim  g(x)  =  ±00, 

x— >b—  x—*-b— 


lira  ^-Jr—  =  L    (finite  or  ±00). 


lim  — — -  =  L. 


(2.4.2) 


(2.4.3) 


(2.4.4) 


Proof  We  prove  the  theorem  for  finite  L  and  leave  the  case  where  L  =  ±00  to  you 
(Exercise  2.4.1). 

Suppose  that  e  >  0.  From  (2.4.3),  there  is  an  xq  in  (a,  b)  such  that 


/'(c) 


<  €     if     Xq  <  C  <  b. 


(2.4.5) 


Theorem  2.3.10  implies  that  if  x  and  f  are  in  [xq,  b),  then  there  is  a  c  between  them,  and 
therefore  in  (xo,  b),  such  that 

lg  (x)  -  g{t)]f'{c)  =  [f{x)  -  fit)]g'ic).  (2.4.6) 

Since  g'  has  no  zeros  in  (a,  b),  Theorem  2.3.1 1  implies  that 

gix)  -  git)  ^0    if  x,teia,b). 

This  means  that  g  cannot  have  more  than  one  zero  in  (a,b).  Therefore,  we  can  choose  xq 
so  that,  in  addition  to  (2.4.5),  g  has  no  zeros  in  [xq,  b).  Then  (2.4.6)  can  be  rewritten  as 


/(*)  ~  fit)  _  f'ic) 
gix)  -  git)       g'ic)  ' 


so  (2.4.5)  implies  that 


fix)  -  fiO 


-L 


<  e    if   x,  t  e  [xq,  b). 


gix)  -  git) 

If  (2.4.1)  holds,  let  x  be  fixed  in  [xq,  b),  and  consider  the  function 

G(t)=f^-m-L. 


(2.4.7) 


From  (2.4.1), 


so 


gix)  ~  git) 


lim  fit)  =  lim  git)  =  0, 

t^fb-  t^b- 


lim  Git)  =  i^l-L. 
t-*b-  gix) 


(2.4.8) 
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Since 

\G(t)\  <  e    if   x0  <  f  < 
because  of  (2.4.7),  (2.4.8)  implies  that 

m 


<  6. 


This  holds  for  all  x  in  (xq,  b),  which  implies  (2.4.4). 

The  proof  under  assumption  (2.4.2)  is  more  complicated.  Again  choose  xq  so  that  (2.4.5) 
holds  and  g  has  no  zeros  in  [xq,  b).  Letting  t  =  xq  in  (2.4.7),  we  see  that 

f(x)  -  f(x0) 


g(x)  -  g(x0) 


-L 


<  e    if    xq  <  x  <  b. 


(2.4.9) 


Since  limx^6_  / (x)  =  ±oo,  we  can  choose  x\  >  xq  so  that  / (x)  ^  0  and  / (x)  ^  / (xo) 
if  x\  <  x  <  b.  Then  the  function 


u{x) 


I  -  g(x0)/g(x) 


1  -  f(x0)/f(x) 
is  defined  and  nonzero  if  x\  <  x  <  b,  and 


lim  u(x)  =  1, 

x— >b— 


(2.4.10) 


because  of  (2.4.2). 
Since 


f(x)-f(x0)      f{x)  1 -/(*„)//(*) 


-  g(x0)  1  -  g(x0)/g(x)  g(x)u(x) 

(2.4.9)  implies  that 

m 


g(x)u(x) 


-L 


<  e    if    x\  <  x  < 


which  can  be  rewritten  as 

m 


g{x) 


—  Lu(x) 


<  e\u(x)\    if    x\  <  x  <  b. 


(2.4.11) 


From  this  and  the  triangle  inequality, 


/(*)  J 

< 

fix) 

g(x) 

g(x) 

Lu{x) 


+  \Lu(x)  -L\<  e\u(x)\  +  \L\  \u(x)  -  1|.  (2.4.12) 


Because  of  (2.4. 10),  there  is  a  point  Xi  in  (x\ ,  b)  such  that 

\u(x)  —  1|  <  e    and  therefore     \u(x)\  <  1  +  e    if    Xi  <  x  <  b. 
This,  (2.4.1 1),  and  (2.4.12)  imply  that 
fix) 


gix) 


-L 


<e(l+e)+|L|e    if   x2  <  x  <  b. 
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which  proves  (2.4.4)  under  assumption  (2.4.2).  H 

Theorem  2.4.1  and  the  proof  given  here  remain  valid  if  b  =  oo  and  "x  — ►  b—"  is 
replaced  by  "x  — ►  oo"  throughout.  Only  minor  changes  in  the  proof  are  required  to  show 
that  similar  theorems  are  valid  for  limits  from  the  right,  limits  at  — oo,  and  ordinary  (two- 
sided)  limits.  We  will  take  these  as  given. 

The  Indeterminate  Forms  0/0  and  oo/oo 
We  say  that  f/g  is  of  the  form  0/0  as  x  — >  b—  if 

lim  f(x)  =   lim  g(x)  =  0, 

x-+b—  x-^b— 


or  of  the  form  oo/oo  as  x  — >  b—  if 


and 


lim  f(x)  =  ±oo 

x— >b— 


lim  g(x)  =  ±oo. 

x— *b— 


The  corresponding  definitions  for  x  — >  b+  and  x  — >•  ±oo  are  similar.  If  f/g  is  of  one  of 
these  forms  as  x  — >  b—  and  as  x  — >  b+,  then  we  say  that  it  is  of  that  form  as  x  — >•  b. 

Example  2.4.1  The  ratio  sinx/x  is  of  the  form  0/0  as  x  — ►  0,  and  L'Hospital's  rule 
yields 

sinx  cosx 
lim  =  lim   =  1. 

x~*0     x  x->0  1 

Example  2.4.2  The  ratio  e~x  jx  is  of  the  form  oo/oo  as  x  — ►  — oo,  and  L'Hospital's 
rule  yields 

e_x               — e_x 
lim    =    lim   =  — oo. 

X— >~ oo    x  x— >  —  00  1 

Example  2.4.3  Using  L'Hospital's  rule  may  lead  to  another  indeterminate  form;  thus, 

ex  ex 
lim  —  =  lim  — 

x-^oo  xz       x-*oo  2x 

if  the  limit  on  the  right  exists  in  the  extended  reals.  Applying  L'Hospital's  rule  again  yields 

lim  —  =  lim  —  =  oo. 

x->oo  2x       x^tx  2 


Therefore, 
More  generally, 
for  any  real  number  a  (Exercise  2.4.33) 


lim  —  =  oo. 


lim  —  =  oo 

x— »oo  xa 
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Example  2.4.4  Sometimes  it  pays  to  combine  L'Hospital's  rule  with  other  manipula- 
tions. For  example, 

4  —  4  cos  x  —  2  sin2  x  4  sin  x  —  4  sin  x  cos  x 

lim  =  lim  z  

4x3 

sinx\  (       1  —  cosx 
lim          I  I  lim 


x~*0     X     I  \x^0  X 


-I 

(       sinx\  /  sinx\ 
=    lim   lim  

\x^-0     X    )  \x^0    2x  ) 

If  sinx\ 
=  _  lim  

2  \x^o    x  J 
=  \^)2  =  \    (Example  2.4.1). 

As  another  example,  L'Hospital's  rule  yields 

e'*2  log(l  +  x)           -2xe~x2  log(l  +  x)  +  e~x2(l  +  x)~l 
lim   =  lim   =  1. 

x->0  X  x-*0  1 

However,  it  is  better  to  remove  the  "determinate"  part  of  the  ratio  before  using  L'Hospital's 
rule: 


lim  £!!Mi±^)  =  ( lim  e-A  ( lim  M+fO) 

x^O  X  \x^0  J  \x^0  X  J 


=  (1)  lim  l0g(1+X) 

x^O  X 

=  lim   =  1.  ■ 

x~^o  1 

In  using  L'Hospital's  rule  we  usually  write,  for  example, 

fix)  fix) 
lim  J-)-L  =  lim  J-^-l  (2.4.13) 

x^b  g(x)        x^b  g'(x) 

and  then  try  to  find  the  limit  on  the  right.  This  is  convenient,  but  technically  incorrect,  since 
(2.4.13)  is  true  only  if  the  limit  on  the  right  exists  in  the  extended  reals.  It  may  happen  that 
the  limit  on  the  left  exists  but  the  one  on  the  right  does  not.  In  this  case,  (2.4.13)  is  incorrect. 

Example  2.4.5  If 

7  1 

f(x)  =  x—  x  sin—    and    g(x)  =  sinx, 
x 

then 

f'(x)  =  1  —  2x  sin  — h  cos  —    and    g'(x)  =  cos  x. 
x  x 
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Therefore,  limx^o  f'(x)/g'(x)  does  not  exist.  However, 

fix)            l-xsin(l/x)  1 
lim   =  lim   =  -  =  1. 

x^-o  gix)      x^o     (sinx)/x  1 


The  Indeterminate  Form  0 


oo 


We  say  that  a  product  fg  is  of  the  form  0  ■  oo  as  x  — >  b—  if  one  of  the  factors  approaches 
0  and  the  other  approaches  ±oo  as  x  — >  b— .  In  this  case,  it  may  be  useful  to  apply 
L'Hospital's  rule  after  writing 

/(x)g(x)  =  -M-    or    fix)gix)=  !<iX) 


since  one  of  these  ratios  is  of  the  form  0/0  and  the  other  is  of  the  form  oo/oo  asi->  b— . 
Similar  statements  apply  to  limits  as  x  —s-  b+,  x  — >  b,  and  x  — >  ±oo. 

Example  2.4.6  The  product  x  log  x  is  of  the  form  0  •  oo  as  x  — >  0+.  Converting  it  to 
an  oo/oo  form  yields 

i-      i  r  los* 

lim  *  log  x  =   lim  — - — 

x^0+  x^0+  l/X 

=    lim   —^r 

x^0+  -l/X2 

=  —  lim  x  =  0. 

Converting  to  a  0/0  form  leads  to  a  more  complicated  problem: 

x 


lim  x  log  x  =  lim 

x^0+  x^0+  1/  log* 

1 

=  lim 


x^0+  -l/x(logx)2 


=  —  lim  x(logx)   =  ? 

x^0  + 


Example  2.4.7  The  products  log(l+  l/x)  is  of  the  form0-oo  as  x  ->•  oo.  Converting 
it  to  a  0/0  form  yields 

r  i  n  ^1/  ^  r  log(1  +  V*) 
lim  xlog(l  +  l/x)  =  lim   —  

x-+oo  x^oo  l/x 

[1/(1  +  l/X)]  (-1/X2) 

=  lim   — —  

x^oo  —l/x2 
1 

=  lim   —  =  1. 

x-*oo  1  +  l/x 
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In  this  case,  converting  to  an  oo/oo  form  complicates  the  problem: 

X 

lim  xlog(l  + 1/x) =  lim  TTTTT^ 

x->oo  x->oo  l/log(l  +  1/x) 

1 

=  lim 

x->oo 


([log(l  +  l/x)p)G  +  l/J 
=  lim  x(x  +  l)[log(l  +  1/x)]2  =  ? 


The  Indeterminate  Form  oo  —  oo 

A  difference  f  —  g  is  of  the  form  oo  —  oo  as  x  — >  6—  if 

lim  /(x)  =   lim  g(x)  =  ±oo. 

x^>&-  x^fc- 

In  this  case,  it  may  be  possible  to  manipulate  f  —  g  into  an  expression  that  is  no  longer 
indeterminate,  or  is  of  the  form  0/0  or  oo/oo  as  x  — >  b—.  Similar  remarks  apply  to  limits 
asi->  b+,  x  — >  Z?,  or  x  — >  ±oo. 

Example  2.4.8  The  difference 

sinx  1 


x^  x 

is  of  the  form  oo  —  oo  as  x  — >  0,  but  it  can  be  rewritten  as  the  0/0  form 

sinx  —  x 


Hence, 


/sinx  1\  sinx  —  x  cosx  — 1 
lim  — z  =  lim  -       =  lim  

x^o  \  xA       x  J     x^o      xA  x^o  2x 

—  sinx 
hm  =  0. 

x-i-o  2 


x2  —  x 


Example  2.4.9  The  difference 
is  of  the  form  oo  —  oo  as  x  — >  oo.  Rewriting  it  as 


x2 


which  is  no  longer  indeterminate  asu-oo,  we  find  that 
lim  (x2  —  x)  =  lim  x2 


x->oo  x~*oo 


K) 


(  lim  x2)  lim  (  1  -  -  ) 

XX— >oo      /  x— >oo  V         x  I 


=  (oo)(l)  =  OO 
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The  Indeterminate  Forms  0°,  1°°,  and  oo° 
The  function  fs  is  defined  by 

f(x)gM  =  egMlosf(x)  =  exp(g(x)log/(x)) 

for  all  x  such  that  /(x)  >  0.  Therefore,  if  /  and  g  are  defined  and  f(x)  >  0  on  an  interval 
(a,  b),  Exercise  2.2.22  implies  that 

lim  [f(x)]gM  =  exp  (  lim  g(x)  log  f(x))  (2.4.14) 

x— *b—  \x—>b—  ) 

if  limx^_  g{x)  log  / (x)  exists  in  the  extended  reals.  (If  this  limit  is  ±oo  then  (2.4.14)  is 
valid  if  we  define  e~°°  =  0  and  e°°  =  oo.)  The  product  g  log  /  can  be  of  the  form  0  ■  oo 
in  three  ways  as  x  — ►  b— : 

(a)  If  limx^6_  g(x)  =  0  and  limx^6_  f(x)  =  0. 

(b)  If  lim*-^-  g(x)  =  ±oo  and  limx^6_  f(x)  =  1. 

(c)  If  limx_*h_  g(x)  =  0  and  limx_+h_  /(x)  =  oo. 

In  these  three  cases,  we  say  that  f8  is  of  the  form  0°,  1°°,  and  oo°,  respectively,  as  x  — >■ 
Z>— .  Similar  definitions  apply  to  limits  as  x  — ►  6+,  x      b,  and  x  — >  ±oo. 

Example  2.4.10  The  function  xx  is  of  the  form  0°  as  x  ->•  0+.  Since 

_  ex\ogx 

and  limx_>o+  X  logx  =  0  (Example  2.4.6), 

lim  xx  =  e°  =  1. 

x^-0  + 

Example  2.4.11  The  function  x1^*-^  is  of  the  form  l°°  as  x  ->■  1.  Since 
and 

log  x  1/x 
lim   =  lim         =  1, 

x~*\  X  —  1       *-M  1 

it  follows  that 

limx1^-1>=e1=e. 
Example  2.4.12  The  function  x1^  is  of  the  form  oo°  as  x  — >•  oo.  Since 


and 
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it  follows  that 


lim  x1/x  =  e°  =  1. 

x— >oo 

2.4  Exercises 


1.    Prove  Theorem  2.4.1  for  the  case  where  limx^b-  f'(x)/s'(x)  ~  ±oo. 
In  Exercises  2.4.2-2.4.40,  find  the  indicated  limits. 

1  -  cos  x  1  +  COS  X 


2.  lim 


tan  1  x 


3.  lim 


sin  1  x 


5.  lim 


smnx 


6.  lim 


log(l  +  x2) 
log(l  +  x) 


4.  lim 


x^tt  sin  x 


x-s-0  x 


x-»-o+    ex  —  1 


7.      lim  e*  sine  * 

x— >oo 


8.      lim  jcsin(l/x)         9.      lim  VT(e  1/x  —  1)    10.      lim  tanxlogx 

x-^oo  x^-oo  x^0  + 


11.     lim  sinx  log(|  tanx|) 

X— >K 

13.  lim(VTTT-V^) 


15.     lim  (cot  x  —  esc  x) 

x->0 


17.     lim  |sinx|tanx 

X— >JT 


19.     lim  I  sinxl* 

x^O 

21.      lim  jcsin(1/x) 


23.      lim  xa\ogx 

x^-0+ 


25. 


lim  (±±T) 

x^oo  y  x  —  1  / 


Vx^i 


27.  lim 


(log  x) 


x^-oo  x 

29.      lim  (jca-logjc) 


12.  lim 


26. 


— h  log(tan  jc) 


14.  Itmf^-i) 

x->o       —  1     x J 

16.     lim  f  J—  I) 

x^o  \sinx  x/ 


18.       lim  |tanx|cos* 

x— >?r/2 


20.  lim(l+jc) 

x^O 


1/x 


22.  lim 


4- 


24.  lim 


COS  X  X 

log(logx) 


x^e  sin(jc  —  e) 


lim  f£±T) 

x^l  +  \X  -1  J 


4x^1 


28.      lim  (cosh*  —  sinhx) 

x— >oo 

30.       lim  ex2sin(ex) 
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sin x  —  x  ~\~ 

31.      lim x(x  +  l)[log(l  +  l/x)]2     32.  lim- 


....       ..     ex  34.        .....  , 

33.      lim  —  x^3tt/2- 

35.      lim  (logx)"log(logx)  36.      lim  — ^  

37.  lim  (sinx)mnx 

"Vr! 

38.  lim   (n  =  integer  >  1) 

x^O  xn  - 


I 


(2r  +  1)! 


/2r+l 

sinx  — 

r  =  () 

0-l/x2 


39.     lim  ^j-j   (n  =  integer  >  0) 


e 

40.  lim  =  0    («  =  integer) 

x^O      X"  5 

41.  (a)   Prove:  If  /  is  continuous  at  xq  and  limx^xo  f'(x)  exists,  then  f'(xo)  exists 

and  /'  is  continuous  at  xq. 
(b)    Give  an  example  to  show  that  it  is  necessary  to  assume  in  (a)  that  /  is  con- 
tinuous at  xq. 

42.  The  iterated  logarithms  are  defined  by  Lo(x)  =  x  and 

Ln{x)  =  log(L„_!(x)),    x  >  a„,    n  >  1, 

where  a\  =  0  and  an  =  ea'l~l ,  n  >  1.  Show  that 

(a)  Ln(x)  =  L„_i(logx),    x  >  a„,    n  >  1. 

(b)  L„_i(a„+)  =  0  and  Ln(a„+)  =  -oo. 

(c)  lim  (L„_  i        L„  (x)  =  0  if  a  >  0  and  n  >  1 . 

(d)  lim  (L„(x))™/L„_i(x)  =  0  if  a  is  arbitrary  and  n  >  1. 

43.  Let  /  be  positive  and  differentiable  on  (0,  oo),  and  suppose  that 

f'(x) 

lim  =  L,    where    0  <  L  <  oo. 

fix) 

Define  fo(x)  =  x  and 

/»(*)  =  / (/»-!(*)),  «>L 

Use  L'Hospital's  rule  to  show  that 

lim  w"v  77    =oo    if    a  >  0    and    n  >  1. 
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44.  Let  /  be  differentiable  on  some  deleted  neighborhood  N  of  xo,  and  suppose  that  / 
and  /'  have  no  zeros  in  N.  Find 

(a)  lim  \f(x)\f(-x)  if  lim  f(x)  =  0; 

X— >XQ  X—>XQ 

(b)  lim  if  lim  f(x)  =  1; 

X—>X()  X— >X() 

(c)  lim  if  linw0  f(x)  =  oo. 

x-rxo 

45.  Suppose  that  /  and  g  are  differentiable  and  g'  has  no  zeros  on  (a,  /?).  Suppose  also 
that  \imx^b-  f'(x)/g'(x)  =  L  and  either 

lim  f(x)  =   lim  g(x)  =  0 

x^-b—  x—*-b— 

or 

lim  /(x)  =  oo    and       lim  g(x)  =  ±oo. 

x— >b—  '  x^fb— 

Findlimx^6_(l  +  f(x))l/g{x\ 

46.  We  distinguish  between  oo  •  oo  (=  oo)  and  (— oo)oo  (=  — oo)  and  between  oo  +  oo 
(=  oo)  and  — oo  —  oo  (=  — oo).  Why  don't  we  distinguish  between  0  ■  oo  and 
0  ■  (— oo),  oo  —  oo  and  — oo  +  oo,  oo/oo  and  — oo/oo,  and  1°°  and 

2.5  TAYLOR'S  THEOREM 

A  polynomial  is  a  function  of  the  form 

p{x)  =  ao  +  a\{x  -  xo)  H  \-  a„(x  -  x0)",  (2.5.1) 

where  ao,  . . . ,  an  and  xq  are  constants.  Since  it  is  easy  to  calculate  the  values  of  a  polyno- 
mial, considerable  effort  has  been  devoted  to  using  them  to  approximate  more  complicated 
functions.  Taylor's  theorem  is  one  of  the  oldest  and  most  important  results  on  this  question. 

The  polynomial  (2.5.1)  is  said  to  be  written  in  powers  of  x  —  xq,  and  is  of  degree  n  if 
an  ^  0.  If  we  wish  to  leave  open  the  possibility  that  an  =  0,  we  say  that  p  is  of  degree 

<  n.  In  particular,  a  constant  polynomial  p(x)  =  ao  is  of  degree  zero  if  ao  ^  0.  If 
ao  =  0,  so  that  p  vanishes  identically,  then  p  has  no  degree  according  to  our  definition, 
which  requires  at  least  one  coefficient  to  be  nonzero.  For  convenience  we  say  that  the 
identically  zero  polynomial  p  has  degree  — oo.  (Any  negative  number  would  do  as  well  as 
— oo.  The  point  is  that  with  this  convention,  the  statement  that  p  is  a  polynomial  of  degree 

<  n  includes  the  possibility  that  p  is  identically  zero.) 

Taylor  Polynomials 

We  saw  in  Lemma  2.3.2  that  if  /  is  differentiable  at  xq,  then 

f(x)  =  f{x0)  +  f'(x0)(x  -  x0)  +  E(x)(x  -  x0), 
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where 


lim  E(x)  =  0. 

X  — >XQ 


To  generalize  this  result,  we  first  restate  it:  the  polynomial 

^(x)  =  f(x0)  +  f'(x0)(x  -Xo), 
which  is  of  degree  <  1  and  satisfies 

7\(x0)  =  /(x0),    Tfeo)  =  /'(xo), 
approximates  /  so  well  near  Xo  that 

lim  l±-L  lJ_L  =  o.  (2.5.2) 

x^x0        X  —  Xo 

Now  suppose  that  /  has  n  derivatives  at  xq  and  Tn  is  the  polynomial  of  degree  <  n 
such  that 

T^\x0)  =  f(r\x0),    0<r<n.  (2.5.3) 

How  well  does  Tn  approximate  /  near  .To  ? 

To  answer  this  question,  we  must  first  find  Tn.  Since  Tn  is  a  polynomial  of  degree  <  n, 
it  can  be  written  as 

Tn(x)  =  ct0  +  ai(x  -  x0)  H  h  a„(x  -  x0)n,  (2.5.4) 

where  ao,  ■  ■  ■ ,  an  are  constants.  Differentiating  (2.5.4)  yields 

T^\xo)  =  r\ar,  0<r<n, 
so  (2.5.3)  determines  ar  uniquely  as 

f(r\xQ) 


0  <  r  <  n. 


r\ 

Therefore, 


Tn(x)  =  /(x0)  H  - — (x  —  xo)  H  1  ;  (x  -  xo) 

1!  n\ 

^/(r)(*o),  v 
=  1^ — h — (x-x0)  . 

r=0 

We  call  Tn  the  nth  Taylor  polynomial  of  f  about  Xq. 

The  following  theorem  describes  how  Tn  approximates  /  near  xo. 

Theorem  2.5.1  If  f*-n'(xo)  exists  for  some  integer  n  >  1  and  T„  is  the  nth  Taylor 
polynomial  of  f  about  Xq,  then 

f(x)-Tn(x) 

lim  l\L  =  o.  (2.5.5) 

x->-xq      (X  —  Xq)" 
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Proof  The  proof  is  by  induction.  Let  Pn  be  the  assertion  of  the  theorem.  From  (2.5.2) 
we  know  that  (2.5.5)  is  true  if  n  =  1;  that  is,  Pi  is  true.  Now  suppose  that  P„  is  true  for 
some  integer  n  >  1,  and  exists.  Since  the  ratio 

/(■*)  -  Tn+i{x) 
ix-x0)n+1 

is  indeterminate  of  the  form  0/0  as  x  — >  xq,  L'Hospital's  rule  implies  that 
nm  fix)-Tn+iix)  =  _J_  nm  f>ix)-TUl(x) 

x^x0      (x  —  Xo)"+1  n  +  1  x^x0         (x  —  Xq)" 

if  the  limit  on  the  right  exists.  But  /'  has  an  nth  derivative  at  xq,  and 
Tn+Xix)  =  2^  -j  ix-xo) 

r=0 

is  the  «th  Taylor  polynomial  of  /'  about  *o-  Therefore,  the  induction  assumption,  applied 
to  /',  implies  that 

f'ix)-T'(x) 
lim  —  "+1       =  0. 

x^x0         (x  —  X0)n 


This  and  (2.5.6)  imply  that 


,.  fix)-Tn+iix) 

lim  —  — - —  =  0, 

x^x0      (x  —  Xo)"  + 


which  completes  the  induction.  TJ 
It  can  be  shown  (Exercise  2.5.8)  that  if 

Pn  =  ao  +  aiix  —xo)  H  \-  a„ix  -  x0)n 

is  a  polynomial  of  degree  <  n  such  that 

,.  fix)~Pnix)  _ 
lim  —   =  0, 

x^x0      (x  —  Xq)" 

then 

f{r)ixo) 

ar  =   ; — ; 

r! 

that  is,  pn  =  Tn.  Thus,  Tn  is  the  only  polynomial  of  degree  <  n  that  approximates  /  near 
xq  in  the  manner  indicated  in  (2.5.5). 

Theorem  2.5.1  can  be  restated  as  a  generalization  of  Lemma  2.3.2. 
Lemma  2.5.2  If  f^nHx0)  exists,  then 

=  E      r,      (x-xo)r  +  E„(x)(x-x0)n,  (2.5.7) 


where 


lim  En(x)  =  E„(x0)  =  0. 

X— >XQ 
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Proof  Define 

f(x)-Tn(x) 


X  €  Df  -  {x0}, 


E„(x)  =  i  (x-x0)» 

1 0,  x  =  xo- 

Then  (2.5.5)  implies  that  limx^xo  En(x)  =  En  (xq)  =  0,  and  it  is  straightforward  to  verify 
(2.5.7).  '  U 

Example  2.5.1  If  f(x)  =  ex,  then  f{n)(x)  =  ex.  Therefore,  /(n)(0)  =  1  foxfl  >  0, 
so  the  7ith  Taylor  polynomial  of  /  about  xq  =  0  is 

"       r  2  n 

EX  XX  X 

r  1       2  n\ 

r=0 

Theorem  2.5.1  implies  that 

n  r 

lira  =2  =  0. 

x^0  x" 

(See  also  Exercise  2.4.38.) 

Example  2.5.2  If  f(x)  =  logx,  then  /(l)  =  0  and 

/«(*)  =  ^M,  r>l, 

so  the  /ith  Taylor  polynomial  of  /  about  Xq  =  1  is 

Tn(x)  =  j^t^(x-\y 

r=i 

if  7i  >  1.  (Tb  =  0.)  Theorem  2.5.1  implies  that 

n 

log^-^C-if-V^-i/ 

 r  =  l  

(JC  -  1)" 

Example  2.5.3  If  f(x)  =  (1  +  then 


lim  —   =0.    7i  >  1. 


f'(x)  =  q(l  +  x)"-1 
f"{x)  =  q(q  -  1)(1  +  xf~2 


fw(x)  =  q{q-\)-{q-n  +  1)(1  +  xf~n. 
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If  we  define 


0    =1    ^    r)  =  g(g"1)"H|g"n+1),  n>l, 


then 


/(«)(0) 


«! 


and  the  nth  Taylor  polynomial  of  /  about  0  can  be  written  as 

n     /  \ 


X  . 


r=0 


Theorem  2.5.1  implies  that 


lim  • 

x->0 


1=0 


=  0,    n>  0. 


(2.5.9) 


If  q  is  a  nonnegative  integer,  then  ( ^  ]  is  the  binomial  coefficient  defined  in  Exer- 

w 

cise  1.2.19.  In  this  case,  we  see  from  (2.5.9)  that 

Tn(x)  =  (1  +  x)q  =  f(x),  n>q. 


Applications  to  Finding  Local  Extrema 

Lemma  2.5.2  yields  the  following  theorem. 

Theorem  2.5.3  Suppose  that  f  has  n  derivatives  at  xq  and  n  is  the  smallest  positive 
integer  such  that  f^(xo)  ^  0. 

(a)  Ifn  is  odd,  Xo  is  not  a  local  extreme  point  of  f. 

(b)  Ifn  is  even,  xq  is  a  local  maximum  of  f  if  /    (xo)  <  0,  or  a  local  mininum  of  f  if 
/""(  v,,)  >  0. 

Proof   Since  f{r)(x0)  =  0  for  1  <  r  <  n  -  1,  (2.5.7)  implies  that 

/(n)(*o) 


f{x)  -  fix0) 


n\ 


+  E„ix) 


ix-xo)n 


(2.5.10) 


in  some  interval  containing  xq.  Since  \imx^.xo  En  (jc)  =  0  and  f(n)(x0)  ^  0,  there  is  a 
8  >  0  such  that 

fW(x0) 


\E„ix)\  < 


if    \x  —  xq\  <  8. 
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This  and  (2.5.10)  imply  that 

f(x)  -  f(x0) 

^-V- — (2.5.11) 

(x  -  x0y 

has  the  same  sign  as  /^"'(xo)  if  0  <  \x  —  xq\  <  S.  If  n  is  odd  the  denominator  of  (2.5.11) 
changes  sign  in  every  neighborhood  of  xq,  and  therefore  so  must  the  numerator  (since  the 
ratio  has  constant  sign  for  0  <  \x  —  xq\  <  S).  Consequently,  /(xo)  cannot  be  a  local 
extreme  value  of  /.  This  proves  (a).  If  n  is  even,  the  denominator  of  (2.5.11)  is  positive 
for  x  ^  xo,  so  / (x)  —  / (xo)  must  have  the  same  sign  as  f'n^(xo)  for  0  <  \x  —  xq\  <  8. 
This  proves  (b).  H 

For  n  =  2,(b)  is  called  the  second  derivative  test  for  local  extreme  points. 

Example  2.5.4  If  f(x)  =  e*3,then  f'(x)  =  3x2ex37  and  0  is  the  only  critical  point 
of  /.  Since 

f"{x)  =  (6x  +  9x4)ex3 

and 

f"'(x)  =  (6  +  54x3  +  21x6)ex\ 

f"(0)  =  0  and  f"'(0)  ^  0.  Therefore,  Theorem  2.5.3  implies  that  0  is  not  a  local  extreme 
point  of  /.  Since  /  is  differentiable  everywhere,  it  has  no  local  maxima  or  minima. 

Example  2.5.5  If  / (x)  =  sin*2,  then  f'(x)  =  2x  cos  x2,  so  the  critical  points  of  / 
are  0  and  ±  y/(k  +  l/2)n,  k  =  0,1,2,  Since 

f"(x)  =  2  cos  x2  -  Ax2  sin  jc2, 
/"(0)  =  2    and    /"  (±  y/(k  +  l/2)jr))  =  (-l)k+1  (4k  +  2)it. 

Therefore,  Theorem  2.5.3  implies  that  /  attains  local  minima  at  0  and  ±  ^J(k  +  \/2)n  for 
odd  integers  k,  and  local  maxima  at  ±  ^{k  +  \/2)n  for  even  integers  k. 


Taylor's  theorem 

Theorem  2.5.1  implies  that  the  error  in  approximating  f(x)  by  Tn(x)  approaches  zero 
faster  than  (x  —  xq)"  as  x  approaches  Xq,  however,  it  gives  no  estimate  of  the  error  in 
approximating  f(x)  by  Tn{x)  for  a  fixed  x.  For  instance,  it  provides  no  estimate  of  the 
error  in  the  approximation 

m  0.1  (0.1)2 

e01  %  7M0.1)  =  1  +  —  +  i— i-  =  1.105  (2.5.12) 

obtained  by  setting  n  =  2  and  x  =  0.1  in  (2.5.8).  The  following  theorem  provides  a  way 
of  estimating  errors  of  this  kind  under  the  additional  assumption  that  exists  in  a 

neighborhood  of  xq. 


104    Chapter  2  Differential  Calculus  of  Functions  of  One  Variable 


Theorem  2.5.4  (Taylor's  Theorem)  Suppose  that  f("+1'>  exists  on  an  open  in- 
terval I  about  Xq,  and  let  x  be  in  I.  Then  the  remainder 

R„(x)  =  f(x)-Tn(x) 

can  be  written  as 

f(n  +  l)(  r\ 
Rn{x)=J——±-f{x-X0)"  +  \ 

(n  +  1)! 

where  c  depends  upon  x  and  is  between  x  and  Xq. 

This  theorem  follows  from  an  extension  of  the  mean  value  theorem  that  we  will  prove 
below.  For  now,  let  us  assume  that  Theorem  2.5.4  is  correct,  and  apply  it. 

Example  2.5.6  If  f(x)  =  ex,  then  f"'{x)  =  ex,  and  Theorem  2.5.4  with  n  =  2 
implies  that 

2  c  3 

x  ex 

ex  =  1  +  x  H  1  , 

2!  3! 

where  c  is  between  0  and  x.  Hence,  from  (2.5.12), 

m  ec(0.1)3 
e01  =  1.105+  — — — , 
6 

where  0  <  c  <  0.1.  Since  0  <  ec  <  e    ,  we  know  from  this  that 

m  e01(0.1)3 

1.105  <  e01  <  1.105  +  — — . 

6 

The  second  inequality  implies  that 


go.i 


\  (Qi)3 


<  1.105. 


so 


e01  <  1.1052. 


Therefore, 

1.105  <  e01  <  1.1052, 
and  the  error  in  (2.5.12)  is  less  than  0.0002. 

Example  2.5.7  In  numerical  analysis,  fo rward  differences  are  used  to  approximate 
derivatives.  If  h  >  0,  the  first  and  second  forward  differences  with  spacing  h  are  defined 
by 

A/(jc)  =  f(x  +  h)  -  f{x) 

and 

A2f(x)  =  A  [A  f(x)]  =  A  f(x  +  h)  -  A  fix) 

=  fix  +  2h)  -  2  f{x  +  h)  +  fix).  {^-U> 

Higher  forward  differences  are  defined  inductively  (Exercise  2.5.18). 


Section  2.5  Taylor's  Theorem  105 


We  will  find  upper  bounds  for  the  magnitudes  of  the  errors  in  the  approximations 

A/(x0) 


and 


/'(*o) 


/"(*o) 


h 


A2f(x0) 


(2.5.14) 


(2.5.15) 


If  /"  exists  on  an  open  interval  containing  xq  and  xq  +  h,  we  can  use  Theorem  2.5.4  to 
estimate  the  error  in  (2.5.14)  by  writing 


f(x0  +  h)  =  f(x0)  +  f'(x0)h  + 


where  Xo  <  c  <  Xo  +  h.  We  can  rewrite  (2.5.16)  as 
f(x0  +  h)-  f(x0) 


which  is  equivalent  to 
Therefore, 


h 

A/(so) 
h 

Af(x0) 


/'(*o)  = 


f"(c)h2 


f'(c)h 


(2.5.16) 


h 


-  f'(xo)  -- 

-  f'(x0) 


f"{c)h 
2 

M2h 


where  M2  is  an  upper  bound  for  \  f"\  on  (xq,  xq  +  h). 

If  /"'  exists  on  an  open  interval  containing  xq  and  xq  +  2h,  we  can  use  Theorem  2.5.4 
to  estimate  the  error  in  (2.5.15)  by  writing 

f(x0  +  h)  =  f(x0)  +  hf'(x0)  +  ¥-f"(xo)  +  %f"\c*) 


and 


Ah  3 

f(x0  +  2h)  =  /(x0)  +  2hf'(x0)  +  2h2f'(x0)  +  —f"'(ci) 


where  xq  <  cq  <  xq  +  h  and  xq  <  c\  <  xq  +  2h.  These  two  equations  imply  that 


f(x0  +  2h)  -  2/(jc0  +  h)  +  f(x0)  =  h2f"(x0)  + 


lf"'(ci)-\fm(co) 


which  can  be  rewritten  as 

A2f(x0) 
h2 

because  of  (2.5.13).  Therefore, 


/"(xo) 


A2f(x0) 


Ii2 


f"(xo) 


5M3h 


where  M3  is  an  upper  bound  for  \f"'\  on  (xq,  xq  +  2h). 
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The  Extended  Mean  Value  Theorem 

We  now  consider  the  extended  mean  value  theorem,  which  implies  Theorem  2.5.4  (Exer- 
cise 2.5.24).  In  the  following  theorem,  a  and  b  are  the  endpoints  of  an  interval,  but  we  do 
not  assume  that  a  <  b. 

Theorem  2.5.5  (Extended  Mean  Value  Theorem)  Suppose  that  f  is  con- 
tinuous on  a  finite  closed  interval  I  with  endpoints  a  and  b  (that  is,  either  I  =  (a,b)  or 
I  =  (b,  a)),  exists  on  the  open  interval  1°,  and,  if  n  >  0,  that  f,  exist 

and  are  continuous  at  a .  Then 

/(*)  -  E  ^—r1^  ~  fl)'  =  -r- #(*  -  (2-5-17) 

^— '     r\  (n  +  1)! 

r=0 

for  some  c  in  1° . 

Proof  The  proof  is  by  induction.  The  mean  value  theorem  (Theorem  2.3.11)  implies 
the  conclusion  for  n  =  0.  Now  suppose  that  n  >  1,  and  assume  that  the  assertion  of  the 
theorem  is  true  with  n  replaced  by  n  —  1.  The  left  side  of  (2.5.17)  can  be  written  as 

f(b)-Y,—^(b-a)  =K  (2.5.18) 

for  some  number  K.  We  must  prove  that  K  =  f(-n+1\c)  for  some  c  in  1°.  To  this  end, 
consider  the  auxiliary  function 

r\  (n  +  1) 

which  satisfies 

h(a)  =  0,    h(b)  =  0, 

(the  latter  because  of  (2.5.18))  and  is  continuous  on  the  closed  interval  I  and  differentiable 
on  7°,  with 

h'(x)  =  f'(x)  -  V  I  (x  -  aY  -  K— — -— — .  (2.5.19) 

r\  n\ 

r=0 

Therefore,  Rolle's  theorem  (Theorem  2.3.8)  implies  that  h'(bi)  =  0  for  some  b\  in  7°; 
thus,  from  (2.5.19), 

I—1       r\  n\ 

r=0 

If  we  temporarily  write  /'  =  g,  this  becomes 

tu  \     n^S(r)(a)  (bx-af 

g{b\)-y   (bi-ay-K  -  =  0.  (2.5.20) 

^— '     r  n\ 

r=0 
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Since  b\  e  1°,  the  hypotheses  on  /  imply  that  g  is  continuous  on  the  closed  interval  / 
with  endpoints  a  and  b\,  g^  exists  on  J°,  and,  if  n  >  1,  g' ,  g*-"-1'  exist  and  are 
continuous  at  a  (also  at  b\,  but  this  is  not  important).  The  induction  hypothesis,  applied  to 
g  on  the  interval  /,  implies  that 

^  g(r)(a)  g{n)(c) 

r=0 

for  some  c  in  J°.  Comparing  this  with  (2.5.20)  and  recalling  that  g  =  f  yields 

K  =  g{n){c)  =  fin+1\c). 
Since  c  is  in  1°,  this  completes  the  induction.  H 

2.5  Exercises 


1.  Let 

/<Hr'  Hi 

Show  that  /  has  derivatives  of  all  orders  on  (— oo,  oo)  and  every  Taylor  polynomial 
off  about  0  is  identically  zero.  Hint:  See  Exercise  2.4.40. 

2.  Suppose  that  f(-n+1^(xo)  exists,  and  let  T„  be  the  «th  Taylor  polynomial  of  /  about 
xo .  Show  that  the  function 

(f(x)-Tn(x) 

^  /  n   ,    x  e  D  f  —  {xq  \, 

E„(x)  =  I  (x-xo)n 

\0,  x  =  x0, 

is  differentiable  at  xo,  and  find  E'n(xo). 

3.  (a)   Prove:  If  /  is  continuous  at  Xq  and  there  are  constants  ao  and  a\  such  that 

f(x)-a0-a1(x-x0) 
hm    =  0, 

x^xq  X  —  Xq 

then  ao  =  f  (xq),  f  is  differentiable  at  xq,  and  f'(xo)  =  ci\ . 
(b)    Give  a  counterexample  to  the  following  statement:  If  /  and  /'  are  continuous 
at  xq  and  there  are  constants  ao,  a\,  and  a-i  such  that 

fix)  -aa  -a\{x  -x0)  -  a2(x  -  x0)2 
lim   =  0, 

x^x0  ix  —  Xo)2 

then  /"(xo)  exists. 

4.  (a)    Prove:  if  f"(xo)  exists,  then 

,.     fix0+h)-2fix0)  + fixo-h) 

hm  —   =  /  (x0). 

h~>0  hz 
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(b)    Prove  or  give  a  counterexample:  If  the  limit  in  (a)  exists,  then  so  does 
f"(xo),  and  they  are  equal. 

5.  A  function  /  has  a  simple  zero  (or  a  zero  of  multiplicity  1)  at  xq  if  /  is  differentiable 
in  a  neighborhood  of  xq  and  / (xq)  =  0,  while  f'(xo)  ^  0. 

(a)  Prove  that  /  has  a  simple  zero  at  xq  if  and  only  if 

f(x)  =  g{x){x  -  Xq), 

where  g  is  continuous  at  xq  and  differentiable  on  a  deleted  neighborhood  of 
x0,  and  g(x0)  ^  0. 

(b)  Give  an  example  showing  that  g  in(a)  need  not  be  differentiable  at  xq. 

6.  A  function  /  has  a  double  zero  (or  a  zero  of  multiplicity  2)  at  xq  if  /  is  twice  dif- 
ferentiable on  a  neighborhood  of  .To  and  f(xo)  =  f'(xo)  =  0,  while  f"(xo)  ^  0. 

(a)  Prove  that  /  has  a  double  zero  at  xq  if  and  only  if 

f{x)  =  g(x)(x-x0)2, 

where  g  is  continuous  at  xq  and  twice  differentiable  on  a  deleted  neighborhood 
of  xq,  g(xo)  ^=  0,  and 

lim  (x  —  xo)g'(x)  =  0. 

(b)  Give  an  example  showing  that  g  in(a)  need  not  be  differentiable  at  xq. 

7.  Let  n  be  a  positive  integer.  A  function  /  has  a  zero  of  multiplicity  n  at  xq  if  / 
is  n  times  differentiable  on  a  neighborhood  of  xq,  f(xo)  =  f'(xo)  =  ■■■  = 
y («— i) (jco)  =  0  and  /^(xq)  ^  0.  Prove  that  /  has  a  zero  of  multiplicity  n  at 
xq  if  and  only  if 

f(x)  =  g(x)(x-x0y, 

where  g  is  continuous  at  xq  and  n  times  differentiable  on  a  deleted  neighborhood  of 
x0,  g(x0)  ^  0,  and 

lim  (x  -  x0)'  g(i){x)  =  0,     1  <  j  <  n  -  1. 

X~*XQ 

HINT:  Use  Exercise  2.5.6  and  induction. 

8.  (a)  Let 

Q{x)  =  a0  +  ai(x  -  x0)  H  hffnfjc  -  x0)n 

be  a  polynomial  of  degree  <  n  such  that 

lim  —  =  0. 

x^xq  (x  —  X0)n 

Show  that  do  =  ai  =  •  -  •  =  a„  =  0. 
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(b)    Suppose  that  /  is  n  times  differentiable  at  Xq  and  p  is  a  polynomial 

p(x)  =  ao  +ai(x  -xo)  H  h  <a„ (jc  —  x0)" 

of  degree  <  n  such  that 

lim  f^-P^  =  0. 

Show  that 

fir)(xo)    .,    n^  ^ 

ar  =   -          if    0  <  r  <  n; 

r\ 

that  is,  p  =  Tn,  the  nth  Taylor  polynomial  of  /  about  xq. 
9.    Show  that  if  fM  (x0)  and  gM  (x0)  exist  and 

hm  —  —  =  0, 

x->-*o    (x  —  x0)n 

then  /(r)(*o)  =  g{r)(x0),  0  <  r  <  n. 

10.  (a)   Let  Fn,  Gn,  and  Hn  be  the  nth  Taylor  polynomials  about  Xq  of  f,  g,  and 

their  product  h  =  fg.  Show  that  Hn  can  be  obtained  by  multiplying  Fn 
by  Gn  and  retaining  only  the  powers  of  x  —  xq  through  the  nth.  Hint:  Use 
Exercise  2.5.8(b). 

(b)    Use  the  method  suggested  by  (a)  to  compute  h^r\xo),  r  =  1,2,3,4. 
(i)   h(x)  =  ex  sinx,    xq  =  0 
(ii)   h(x)  =  (cos  jrx/2)(\ogx),    xq  =  1 

(iii)  h(x)  =  x2  cos  x,    xq  =  ti/2 

(iv)  h{x)  =  (1  +x)~1e~x,    x0  =  0 

11.  (a)   It  can  be  shown  that  if  g  is  n  times  differentiable  at  x  and  /  is  n  times  dif- 

ferentiable at  g(x),  then  the  composite  function  h(x)  =  f(g(x))  is  n  times 
differentiable  at  x  and 


where  VJr  is  over  all  n-tuples  (r\ ,  r^, . . . ,  rn)  of  nonnegative  integers  such  that 

r\  +  r2-\  Yrn  =  r 

and 

r\  +  2r2  H  \-nrn=n. 

(This  is  Faa  eft  Bruno's  formula).  However,  this  formula  is  quite  complicated. 
Justify  the  following  alternative  method  for  computing  the  derivatives  of  a 
composite  function  at  a  point  xq: 
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Let  Fn  be  the  «th  Taylor  polynomial  of  /  about  yo  =  g(xo),  and  let  G„  and 
Hn  be  the  nth  Taylor  polynomials  of  g  and  h  about  xq.  Show  that  Hn  can 
be  obtained  by  substituting  Gn  into  Fn  and  retaining  only  powers  of  x  —  xq 
through  the  nth.  Hint:  See  Exercise  2.5.8(b). 
(b)  Compute  the  first  four  derivatives  of  h(x)  =  cos(sinx)  at  xq  =  0,  using  the 
method  suggested  by  (a) . 

12.  (a)  If  g(xo)  7^  0  and  g^ixo)  exists,  then  the  reciprocal  h  =  1/g  is  also  n  times 
differentiable  at  xq,  by  Exercise  2.5.1 1  (a),  with  / (x)  =  1  /x.  Let  G„  and  Hn 
be  the  nth  Taylor  polynomials  of  g  and/j  aboutxo.  Use  Exercise  2.5.11(a)  to 
prove  that  if  g(xo )  =  1 ,  then  Hn  can  be  obtained  by  expanding  the  polynomial 

n 

r=l 

in  powers  of  x  —  xq  and  retaining  only  powers  through  the  nth. 

(b)  Use  the  method  of  (a)  to  compute  the  first  four  derivatives  of  the  following 
functions  at  xq  . 

(i)    h(x)  =  CSCX,      Xq  =  Jt/2 

(ii)   h(x)  =  (1  +  x  +  x2)-\    xQ  =  0 

(iii)  h(x)  =  secx,    xq  =  tt/4 

(iv)  h{x)  =  [1  +  log(l  +  x)]'1 ,    x0  =  0 

(c)  Use  Exercise  2.5.10  to  justify  the  following  alternative  procedure  for  obtaining 
Hn,  again  assuming  that  g(.xo)  =  L  If 

Gn(x)  =  1  +  di(x  -  Xq)  H  \-a„(x  -  Xq)" 

(where,  of  course,  ar  =  g^  (xq)  /  r\)  and 

Hn(x)  =  b0  +  bi(x  —  xq)  H  h  b„(x  -  x0)n , 


then 


%  =  1,    bk  =  -^,aTbk-T,  l<k<n. 


13.  Determine  whether  xq  =  0  is  a  local  maximum,  local  minimum,  or  neither, 
(a)  f(x)  =  x2ex3  (b)  f(x)  =  x3ex2 

(e)  /(x)  =  x2  sin3  x  +  x2  cosx  (f)  f(x)  =  ex  sinx 

(g)  /(x)  =  ex  sinx2  (h)  /(x)  =  ex2  cosx 

14.  Give  an  example  of  a  function  that  has  zero  derivatives  of  all  orders  at  a  local  mini- 
mum point. 
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15.     Find  the  critical  points  of 


x3  bx2 
f(x)  =  —  +  —  +cx  +  d 

and  identify  them  as  local  maxima,  local  minima,  or  neither. 
16.     Find  an  upper  bound  for  the  magnitude  of  the  error  in  the  approximation. 

(a)  sinjc  rs  x,  \x\  <  — 
v  '  1  1  20 

(b)  vT+7«i  +  |,  \x\<X- 


(c)  cos* 


1 


V2[l-(x-f)],  f<*<f 


5n 


(d)   logx^(x-l)  +  . 


Ijc  —  II  <  — 
1        1  64 


17.     Prove:  If 


r=0 


then 


1  - 


(n  +  1)! 


T»(x) 


T„(x)  <  T„+1(x)  <ex  < 

ifO  <  x  <  [{n  +  l)!]1/("+1). 
18.     The  forward  difference  operators  with  spacing  h  >  0  are  defined  by 

A0 f(x)  =  f(x),     A f(x)  =  fix  +  h)-  fix), 

An+1fix)  =  A[Anfix)],  n>\. 

(a)  Prove  by  induction  on  n:  If  k  >  2,  c\, Cfc  are  constants,  and  n  >  1,  then 

A"[ci/i(x)  +  ■  •  ■  +  cfcA(x)]  =  Cl  A"/i  W  +  •••  +  ckAn  fkix). 

(b)  Prove  by  induction:  If  «  >  1,  then 


m=0 


HINT:  See  Exercise  1.2.19. 
/n  Exercises  2.5.19-2.5.22,  A  z'i  the  forward  difference  operator  with  spacing  h  >  0. 
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19.     Let  m  and  n  be  nonnegative  integers,  and  let  xq  be  any  real  number.  Prove  by 
induction  on  n  that 


A"(x-x0)" 


0  if  0<m<«, 
n\h"    if   m  =  n. 


Does  this  suggest  an  analogy  between  "differencing"  and  differentiation? 
20.     Find  an  upper  bound  for  the  magnitude  of  the  error  in  the  approximation 

A2f(x0-h) 


/"(*o) 


h2 


(a)  assuming  that  /'"  is  bounded  on  {xq  —  h,  xq  +  h); 

(b)  assuming  that        is  bounded  on  (.To  —  h,  xq  +  h). 

21.  Let  /"'  be  bounded  on  an  open  interval  containing  xq  and  xq  +  2h.  Find  a  constant 
k  such  that  the  magnitude  of  the  error  in  the  approximation 

A/(s0)  ,  uA2f(x0) 

is  not  greater  than  Mh2,  where  M  =  sup  (|/'"(c)l  I  l-^o  <  c  <  Xo}. 

22.  Prove:  If  f{n+1)  is  bounded  on  an  open  interval  containing  Xq  and  Xq  +  nh,  then 


AV(*o)  Hn) 


fw(xo) 


h" 

where  An  is  a  constant  independent  of  /  and 


<  A„M„+1h, 


Mn+1  =       sup  \fin+1\c)\. 

xo<c<xo+nh 

Hint:  See  Exercises  2.5.18  and  2.5.19. 

23.  Suppose  that  exists  on  (a,  b),  xq,  . . . ,  xn  are  in  (a,  b),  and  p  is  the  polyno- 
mial of  degree  <  n  such  that  p(xi)  =  f(xt),  0  <  i  <  n.  Prove:  If  x  €  (a,b), 
then 

/■("+1)(c) 

/(x)  =  />(x)  +  - — — —(x  -  x0)(x  -xi)---(x-  x„), 
(n  +  1)! 

where  c,  which  depends  on  x,  is  in  (a,  b).  Hint:  Let  x  be  fixed,  distinct  from  xq, 
X\,       x„,  and  consider  the  function 

g(y)  =  f(y)  - p(y)  -  -( — — r^Xy  - *o)(y  -x{)---(y- %„), 

(n  +  1)! 

where  K  is  chosen  so  that  g(x)  =  0.  Use  Rolle's  theorem  to  show  that  K  = 
f (c)  for  some  c  in(a,b). 

24.  Deduce  Theorem  2.5.4  from  Theorem  2.5.5. 


CHAPTER  3 


Integral  Calculus  of 
Functions  of  One  Variable 


IN  THIS  CHAPTER  we  discuss  the  Riemann  on  a  finite  interval  [a,  b],  and  improper  inte- 
grals in  which  either  the  function  or  the  interval  of  integration  is  unbounded. 

SECTION  3.1  begins  with  the  definition  of  the  Riemann  integral  and  presents  the  geo- 
metrical interpretation  of  the  Riemann  integral  as  the  area  under  a  curve.  We  show  that 
an  unbounded  function  cannot  be  Riemann  integrable.  Then  we  define  upper  and  lower 
sums  and  upper  and  lower  integrals  of  a  bounded  function.  The  section  concludes  with  the 
definition  of  the  Riemann-Stieltjes  integral. 

SECTION  3.2  presents  necessary  and  sufficient  conditions  for  the  existence  of  the  Riemann 
integral  in  terms  of  upper  and  lower  sums  and  upper  and  lower  integrals.  We  show  that 
continuous  functions  and  bounded  monotonic  functions  are  Riemann  integrable. 

SECTION  3.3  begins  with  proofs  that  the  sum  and  product  of  Riemann  integrable  functions 
are  integrable,  and  that  |  / 1  is  Riemann  integrable  if  /  is  Riemann  integrable.  Other  topics 
covered  include  the  first  mean  value  theorem  for  integrals,  antiderivatives,  the  fundamental 
theorem  of  calculus,  change  of  variables,  integration  by  parts,  and  the  second  mean  value 
theorem  for  integrals. 

SECTION  3.4  presents  a  comprehensive  discussion  of  improper  integrals.  Concepts  de- 
fined and  considered  include  absolute  and  conditional  convergence  of  an  improper  integral, 
Dirichlet's  test,  and  change  of  variable  in  an  improper  integral. 

SECTION  3.5  defines  the  notion  of  a  set  with  Lebesgue  measure  zero,  and  presents  a 
necessary  and  sufficient  condition  for  a  bounded  function  /  to  be  Riemann  integrable  on 
an  interval  [a,  b];  namely,  that  the  discontinuities  of  /  form  a  set  with  Lebesgue  masure 
zero. 


3.1  DEFINITION  OF  THE  INTEGRAL 

The  integral  that  you  studied  in  calculus  is  the  Riemann  integral,  named  after  the  German 
mathematician  Bernhard  Riemann,  who  provided  a  rigorous  formulation  of  the  integral  to 
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replace  the  intuitive  notion  due  to  Newton  and  Leibniz.  Since  Riemann's  time,  other  kinds 
of  integrals  have  been  defined  and  studied;  however,  they  are  all  generalizations  of  the 
Riemann  integral,  and  it  is  hardly  possible  to  understand  them  or  appreciate  the  reasons  for 
developing  them  without  a  thorough  understanding  of  the  Riemann  integral.  In  this  section 
we  deal  with  functions  defined  on  a  finite  interval  [a,b].  A  partition  of  [a,  b]  is  a  set  of 
subintervals 

[x0,xi],  [xi,X2],-..,[x„-i,x„],  (3.1.1) 

where 

a  =  xq  <  x\  ■  ■  ■  <  xn  =  b,  (3.1.2) 

Thus,  any  set  of  n  +  1  points  satisfying  (3.1.2)  defines  a  partition  P  of  [a,  b],  which  we 
denote  by 

P  =  {x0,xi,...,xn}. 

The  points  xq,  x\,  xn  are  the  partition  points  of  P.  The  largest  of  the  lengths  of  the 
subintervals  (3.1.1)  is  the  norm  of  P ,  written  as  ||  P  || ;  thus, 

H^ll  =  max  (x,  —  Xi-i). 

\<i<n 

If  P  and  P'  are  partitions  of  [a,b],  then  P'  is  a  refinement  of  P  if  every  partition  point 
of  P  is  also  a  partition  point  of  P';  that  is,  if  P'  is  obtained  by  inserting  additional  points 
between  those  of  P .  If  /  is  defined  on  [a,b],  then  a  sum 

n 

P  =  J2  f(Cj)(Xj  -Xj-i), 
./  =  ! 

where 

Xj-i  <  Cj  <  Xj,     1  <  j  <  n, 

is  a  Riemann  sum  of  f  over  the  partition  P  =  {xq,  Xi,  . . . ,  xn}.  (Occasionally  we  will  say 
more  simply  that  a  is  a  Riemann  sum  of  /  over  [a,  b\.)  Since  cy  can  be  chosen  arbitrarily 
in  [xj,  Xj—i],  there  are  infinitely  many  Riemann  sums  for  a  given  function  /  over  a  given 
partition  P . 

Definition  3.1.1  Let  /  be  defined  on  [a,  b].  We  say  that  /  is  Riemann  integrable  on 
[a,  b]  if  there  is  a  number  L  with  the  following  property:  For  every  e  >  0,  there  is  a  S  >  0 
such  that 

\a-L\  <  e 

if  a  is  any  Riemann  sum  of  /  over  a  partition  P  of  [a,  b]  such  that  ||  P  ||  <  S.  In  this  case, 
we  say  that  L  is  the  Riemann  integral  of  f  over  [a,b],  and  write 

J    f  (x)  dx  =  L. 

J  a 
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We  leave  it  to  you  (Exercise  3. 1 . 1)  to  show  that  f  (x)  dx  is  unique,  if  it  exists;  that  is, 
there  cannot  be  more  than  one  number  L  that  satisfies  Definition  3.1.1. 

For  brevity  we  will  say  "integrable"  and  "integral"  when  we  mean  "Riemann  integrable" 
and  "Riemann  integral."  Saying  that  f  f  (x)  dx  exists  is  equivalent  to  saying  that  /  is 
integrable  on  [a,  b\. 

Example  3.1.1  If 

f(x)  =  1,    a  <  x  <  b, 

then 

n  n 

f(cj)(xj       =  -xj-i). 

7=1  7=1 

Most  of  the  terms  in  the  sum  on  the  right  cancel  in  pairs;  that  is, 
n 

^2(xj  -  xj-i)  =  (xi  -  x0)  +  (x2  —  x\)  H  h  (x„  -  xn-i) 

j=i 

=  -X0  +  (Xy  -  X\)  +  (x2-X2)-\  h  (X„-1  -  Xn-i)  +  X„ 

=  Xn  —  Xo 

=  b  —  a. 

Thus,  every  Riemann  sum  of  /  over  any  partition  of  [a,  b]  equals  b  —  a,  so 

dx  =  b  —  a. 


I 

J  a 


Example  3.1.2  Riemann  sums  for  the  function 

f(x)  =  x,    a  <  x  <  b, 

are  of  the  form 

n 

a  =  ^Cjixj  -  xj-i).  (3.1.3) 

7  =  1 

Since  Xj-\  <  Cj  <  Xj  and  (Xj  +  Xj-\)/2  is  the  midpoint  of  [xj—i,  Xj],  we  can  write 

Xi  +  Xt-i 

Cj=    3    2  1     +dj,  (3.1.4) 

where 

,  ,  ,       Xj  —  Xj-i        II P  || 

\dj\ <   J   2J  (3.1.5) 
Substituting  (3.1.4)  into  (3.1.3)  yields 

yi  n 

Xj  +  Xj-i 


7=1  7=1 
j     n  n 

7=1  7=1 


(3.1.6) 
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Because  of  cancellations  like  those  in  Example  3.1.1, 


so  (3.1.6)  can  be  rewritten  as 


E(*/-*y-i)  =  * 


2      „2  » 


2  2 

~    a  , 


bl-a 


.7  =  1 


Hence, 


b2-a2 


a  — 


 ^  II  jj  ||   ^ 

<  2_,  \dj\(xj  -Xj-i)  <  — -  ^2(xj  -xj-i)    (see  (3.1.5)) 


7=1 

IIPI 


7=1 


-(b-a). 


Therefore,  every  Riemann  sum  of  /  over  a  partition  P  of  [a,  b]  satisfies 


b2-a2\  „  „  „  2e 
a  <e    if      P   <8  =  -  


Hence, 


b  b2_a2 

x  a  x  =   . 


The  Integral  as  the  Area  Under  a  Curve 

An  important  application  of  the  integral,  indeed,  the  one  invariably  used  to  motivate  its 
definition,  is  the  computation  of  the  area  bounded  by  a  curve  y  =  f{x),  the  x-axis,  and 
the  lines  x  =  a  and  x  =  b  ("the  area  under  the  curve"),  as  in  Figure  3.1.1. 

y 

i 


y=f(x) 


Figure  3.1.1 
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For  simplicity,  suppose  that  / (x)  >  0.  Then  / (cj)(xj  —  xj-i)  is  the  area  of  a  rectangle 
with  base  xj  —  xj-\  and  height  / (c,),  so  the  Riemann  sum 

n 

X!  f(cj)(xj  -xj-i) 

./  =  ! 

can  be  interpreted  as  the  sum  of  the  areas  of  rectangles  related  to  the  curve  y  =  f  (x),  as 
shown  in  Figure  3.1.2. 


 1  1  1  1  1  1  1  1  *~  x 

a     c,    x,        c2    x2   c3     x3  c4  b 

Figure  3.1.2 


An  apparently  plausible  argument,  that  the  Riemann  sums  approximate  the  area  under 
the  curve  more  and  more  closely  as  the  number  of  rectangles  increases  and  the  largest  of 
their  widths  is  made  smaller,  seems  to  support  the  assertion  that  f  f  (x)  dx  equals  the 
area  under  the  curve.  This  argument  is  useful  as  a  motivation  for  Definition  3.1.1,  which 
without  it  would  seem  mysterious.  Nevertheless,  the  logic  is  incorrect,  since  it  is  based 
on  the  assumption  that  the  area  under  the  curve  has  been  previously  defined  in  some  other 
way.  Although  this  is  true  for  certain  curves  such  as,  for  example,  those  consisting  of  line 
segments  or  circular  arcs,  it  is  not  true  in  general.  In  fact,  the  area  under  a  more  complicated 
curve  is  defined  to  be  equal  to  the  integral,  if  the  integral  exists.  That  this  new  definition  is 
consistent  with  the  old  one,  where  the  latter  applies,  is  evidence  that  the  integral  provides 
a  useful  generalization  of  the  definition  of  area. 

Example  3.1.3  Let  f(x)  =  x,  1  <  x  <  2  (Figure  3.1.3,  page  118).  The  region  under 
the  curve  consists  of  a  square  of  unit  area,  surmounted  by  a  triangle  of  area  1/2;  thus,  the 
area  of  the  region  is  3/2.  From  Example  3.1.2, 

j\dx=l-(22-\2)=3-. 
so  the  integral  equals  the  area  under  the  curve. 
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Figure  3.1.3 


y 
i 


Example  3.1.4  If 

(Figure  3.1.4),  then 


Figure  3.1.4 

f{x)  =  x2,     1  <  x  <  2 


1 


7 


j  f(x)dx  =  -(2*-li)  =  - 


(Exercise  3.1.4),  so  we  say  that  the  area  under  the  curve  is  7/3.  However,  this  is  the  defini- 
tion of  the  area  rather  than  a  confirmation  of  a  previously  known  fact,  as  in  Example  3.1.3. 
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Theorem  3.1.2  Iff  is  unbounded  on  [a,  b],  then  f  is  not  integrable  on  [a,  b]. 

Proof  We  will  show  that  if  /  is  unbounded  on  [a,  b],  P  is  any  partition  of  [a,  b],  and 
M  >  0,  then  there  are  Riemann  sums  a  and  a'  of  /  over  P  such  that 

\a-o'\>M.  (3.1.7) 

We  leave  it  to  you  (Exercise  3.1.2)  to  complete  the  proof  by  showing  from  this  that  / 
cannot  satisfy  Definition  3.1.1. 

Let 

n 

a  =       f(cj)(xj  -Xj-i) 

./  =  ! 

be  a  Riemann  sum  of  /  over  a  partition  P  of  [a,b].  There  must  be  an  integer  i  in 
{1,2, ...  ,n}  such  that 

\f(c)-f(d)\>      —  (3.1.8) 
for  some  c  in  [x,-_iXj],  because  if  there  were  not  so,  we  would  have 

M 

\f(x)  -  f(cj)\  <  —  ,    xj-i  <x<xj,  l<j<n. 

^  j        %j  —  1 


Then 


\f(x)\  =  \f(Cj)  +  f{x)  -  f(Cj)\  <  \f(Cj)\  +  \f(x)  -  f(Cj)\ 


M 

\f(cj)\  +  —  ,    Xj-i<x<Xj,     1  <j<n. 

X  j       X  j  —  \ 


which  implies  that 


M 

\f(x)\  <  max  \f(cj)\  +  ,    a  <  X  <  b 

l<J<n  Xj  -  Xj-i 

contradicting  the  assumption  that  /  is  unbounded  on  [a,  b]. 

Now  suppose  that  c  satisfies  (3.1.8),  and  consider  the  Riemann  sum 

n 

°'  =  T,f(c'jXxJ-xJ-^ 


over  the  same  partition  P ,  where 
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Since 


\o-o'\  =  \f(c)-f(ct)\(xi  -Xt-l), 


(3.1.8)  implies  (3.1.7). 


H 


Upper  and  Lower  Integrals 

Because  of  Theorem  3.1.2,  we  consider  only  bounded  functions  throughout  the  rest  of  this 
section. 

To  prove  directly  from  Definition  3.1.1  that  f  (x)  dx  exists,  it  is  necessary  to  discover 
its  value  L  in  one  way  or  another  and  to  show  that  L  has  the  properties  required  by  the 
definition.  For  a  specific  function  it  may  happen  that  this  can  be  done  by  straightforward 
calculation,  as  in  Examples  3.1.1  and  3. 1 .2.  However,  this  is  not  so  if  the  objective  is  to  find 
general  conditions  which  imply  that  f  f  (x)  dx  exists.  The  following  approach  avoids  the 
difficulty  of  having  to  discover  L  in  advance,  without  knowing  whether  it  exists  in  the  first 
place,  and  requires  only  that  we  compare  two  numbers  that  must  exist  if  /  is  bounded  on 
[a,  b\.  We  will  see  that  J  f  (x)  dx  exists  if  and  only  if  these  two  numbers  are  equal. 

Definition  3.1.3  If  /  is  bounded  on  [a,  b]  and  P  =  {xq,  X\, . . . ,  xn}  is  a  partition  of 


[a,  6],  let 


sup  f(x) 


and 


inf  f(x). 

Xj—i <X<X J 


The  upper  sum  of  f  over  P  is 


n 


S{P)  =  YJMj{xj-xj-{), 


and  the  upper  integral  of  f  over,  [a,b],  denoted  by 


is  the  infimum  of  all  upper  sums.  The  lower  sum  of  f  over  P  is 


n 


and  the  lower  integral  of  f  over  [a,b],  denoted  by 


is  the  supremum  of  all  lower  sums. 
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If  m  <  f  (x)  <  M  for  all  x  in  [a,  b],  then 

m(b  -a)  <  s(P)  <  S(P)  <  M{b  -  a) 

for  every  partition  P;  thus,  the  set  of  upper  sums  of  /  over  all  partitions  P  of  [a,  b]  is 
bounded,  as  is  the  set  of  lower  sums.  Therefore,  Theorems  1.1.3  and  1.1.8  imply  that 

f  fix)  dx  and  f  fix)  dx  exist,  are  unique,  and  satisfy  the  inequalities 


m(b-a)<  f  f(x)dx  <  M(b  -  a) 

J  a 


and 


■8)<  f 
J  a 


m{b-a)<  /  f{x)dx<M{b-a). 


Theorem  3.1.4  Let  f  be  bounded  on  [a,  b],  and  let  P  be  a  partition  of  [a,  b].  Then 

(a)  The  upper  sum  S(P)  of  f  over  P  is  the  supremum  of  the  set  of  all  Riemann  sums  of 
f  over  P. 

(b)  The  lower  sum  s(P)  of  f  over  P  is  the  infimum  of  the  set  of  all  Riemann  sums  of  f 
over  P. 

Proof    (a)  If  P  =  {x0,  x\, . . . ,  xn),  then 

n 

S(P)  =  £M/(*y-JC/-i), 
7  =  1 

where 

Mj  =      sup  f(x). 

Xj  —  \  <X<Xj 

An  arbitrary  Riemann  sum  of  /  over  P  is  of  the  form 

n 

o  =  J2  f(cj)(xJ  -xj-i), 
j=l 

where  Xj-i  <  Cj  <  Xj.  Since  f(cj)  <  Mj ,  it  follows  that  a  <  S(P). 
Now  let  e  >  0  and  choose  Cj  in  [xj-\,  x j]  so  that 

ficj)>Mj-—  -,  l<j<n. 

nixj  -  xj-i) 

The  Riemann  sum  produced  in  this  way  is 

ixj-Xj-1)  =  SiP)-e. 


a  =  Y^  f(c.o(xJ  -xj-i)  > 
j=i  j=i 


Mj  -  -) 

nixj  -  xj-i) 


Now  Theorem  1.1.3  implies  that  S(P)  is  the  supremum  of  the  set  of  Riemann  sums  of  / 
over  P. 

(b)  Exercise  3.1.7.  U 
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Example  3.1.5  Let 


0     if  x  is  irrational, 
if  x  is  rational, 


and  P  =  {xo,  x\, . . . ,  xn)  be  a  partition  of  [a,  b].  Since  every  interval  contains  both  ratio- 
nal and  irrational  numbers  (Theorems  1.1.6  and  1.1.7), 

i7i  j  =  0    and     Mj  =  1,     1  <  j  <  n. 

Hence, 

n 

S(P)  =  J21'(xi~  XJ~^  =  b~a 

7  =  1 

and 

n 

j(P)  =  ^0-(xy-x7_1)  =  0. 

7  =  1 

Since  all  upper  sums  equal  b  —  a  and  all  lower  sums  equal  0,  Definition  3.1.3  implies  that 
I   f(x)dx  =  b  —  a    and      /   / (x)  dx  =  0. 

J  a  J  a 

Example  3.1.6  Let  /  be  defined  on  [1,  2]  by  f(x)  =  0  if  x  is  irrational  and  f(p/q)  = 
l/q  if  p  and  q  are  positive  integers  with  no  common  factors  (Exercise  2.2.7).  If  P  = 
{xq,  xi,  . . . ,  Xn}  is  any  partition  of  [1,  2],  then  mj  =  0,  1  <  j  <  n,  so  s(P)  =  0;  hence, 

/ (x)  dx  =  0. 


j  l 

We  now  show  that 


2 

f(x)dx  =  0  (3.1.9) 


'i 

also.  Since  S(i3)  >  0  for  every  P ,  Definition  3.1.3  implies  that 


1-2 

f(x)  dx  >  0, 

so  we  need  only  show  that 

f(x)  dx  <  0, 

n 

which  will  follow  if  we  show  that  no  positive  number  is  less  than  every  upper  sum.  To  this 
end,  we  observe  that  if  0  <  e  <  2,  then  / (x)  >  e/2  for  only  finitely  many  values  of  x  in 
[1.2]. 

Let  k  be  the  number  of  such  points  and  let  Pq  be  a  partition  of  [1.  2]  such  that 

||P0||<^.  (3-1.10) 
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Consider  the  upper  sum 

n 

S(P0)  =  YJMj{xj-Xj-l). 

7=1 

There  are  at  most  k  values  of  j  in  this  sum  for  which  M j  >  e/2,  and  Mj  <  1  even  for 
these.  The  contribution  of  these  terms  to  the  sum  is  less  than  k(e/2k)  =  e/2,  because  of 
(3.1.10).  Since  My  <  e/2  for  all  other  values  of  j ,  the  sum  of  the  other  terms  is  less  than 

e   n  e 

2  Y^(xi  ~         =         ~  Xo)  =  2(2  ~V)  =  2' 

7=1 

Therefore,  S(Po)  <  e  and,  since  e  can  be  chosen  as  small  as  we  wish,  no  positive  number 
is  less  than  all  upper  sums.  This  proves  (3.1.9).  ■ 

The  motivation  for  Definition  3.1.3  can  be  seen  by  again  considering  the  idea  of  area 
under  a  curve.  Figure  3.1.5  shows  the  graph  of  a  positive  function  y  =  f  (x),  a  <  x  <  b, 
with  [a,  b]  partitioned  into  four  subintervals. 


a  x2      x3  b 

Figure  3.1.5 


The  upper  and  lower  sums  of  /  over  this  partition  can  be  interpreted  as  the  sums  of  the  areas 
of  the  rectangles  surmounted  by  the  solid  and  dashed  lines,  respectively.  This  indicates  that 
a  sensible  definition  of  area  A  under  the  curve  must  admit  the  inequalities 

s(P)  SA<  S(P) 

for  every  partition  P  of  [a,  b] .  Thus,  A  must  be  an  upper  bound  for  all  lower  sums  and  a 
lower  bound  for  all  upper  sums  of  /  over  partitions  of  [a,b].  If 
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there  is  only  one  number,  the  common  value  of  the  upper  and  lower  integrals,  with  this 
property,  and  we  define  A  to  be  that  number;  if  (3. 1 . 1 1)  does  not  hold,  then  A  is  not  defined. 
We  will  see  below  that  this  definition  of  area  is  consistent  with  the  definition  stated  earlier 
in  terms  of  Riemann  sums. 

Example  3.1.7  Returning  to  Example  3.1.3,  consider  the  function 

f(x)=x,     1  <  x  <  2. 
If  P  =  {xq  ,  X\ , . . . ,  xn}  is  a  partition  of  [  1 ,  2] ,  then,  since  /  is  increasing, 
Mj  =  f(xj)  =  Xj    and    rrij  =  f(xj-\)  =  xj-\. 


Hence, 


and 


By  writing 


S(P)  =  Y,Xj(xj-Xj-i)  (3-1-12) 
j=i 


,(/')- ^\v,  ilAv-.v,  (3.1.13) 


X  f  ~\~  X  j  —  x        X  i       X  i  —  \ 
Xj  =  —  1  . 


we  see  from  (3.1.12)  that 

S(P)  =  \  -  x)_x)  +  \  j^ixj  -  xj^f 


2  ^  1  2 


7      I  \   ;   —    \   ;      i  1 


■xj-iY 


^(22-i2)  +  ^E(^ 

Since 

n  n 

0  <  -Xj-i)2  <\\P\\  T,(XJ-XJ-^  =  Wp\\(2~  !)> 

7=1  7=1 

(3.1.14)  implies  that 

3  3      HP  || 


2       v   '  ~  2  2 
Since  ||  P  ||  can  be  made  as  small  as  we  please,  Definition  3.1.3  implies  that 

fb  3 
J   f(x)dx  =  -. 

A  similar  argument  starting  from  (3.1.13)  shows  that 


(3.1.14) 
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so 

rb 


J  a 


3 

f(x)dx  =  -. 


Since  the  upper  and  lower  integrals  both  equal  3/2,  the  area  under  the  curve  is  3/2  accord- 
ing to  our  new  definition.  This  is  consistent  with  the  result  in  Example  3.1.3. 


The  Riemann  Stieltjes  Integral 

The  Riemann-Stieltjes  integral  is  an  important  generalization  of  the  Riemann  integral.  We 
define  it  here,  but  confine  our  study  of  it  to  the  exercises  in  this  and  other  sections  of  this 
chapter. 

Definition  3.1.5  Let  /  and  g  be  defined  on  [a,  b].  We  say  that  /  is  Riemann-Stieltjes 
integrable  with  respect  to  g  on  [a,  b]  if  there  is  a  number  L  with  the  following  property: 
For  every  e  >  0,  there  is  a  S  >  0  such  that 


<  €,  (3.1.15) 


provided  only  that  P  =  {xq,  x\,  . . . ,  xn}  is  a  partition  of  [a,  b]  such  that  \\P\\  <  S  and 

xj-i  <  Cj  <  Xj,    j  =  1,2, . . .,n. 

In  this  case,  we  say  that  L  is  the  Riemann-Stieltjes  integral  of  f  with  respect  to  g  over 
[a,b],  and  write 

cb 

f(x)dg(x)  =  L. 


f 

J  a 


The  sum 

n 

J2f(cj)[g(xj)-g(xj-1)] 

.7=1 

in  (3.1.15)  is  a  Riemann-Stieltjes  sum  of  f  with  respect  to  g  over  the  partition  P . 

3.1  Exercises 


1.  Show  that  there  cannot  be  more  than  one  number  L  that  satisfies  Definition  3.1.1. 

2.  (a)   Prove:  If     f(x)  dx  exists,  then  for  every  e  >  0,  there  is  a  8  >  0  such  that 

| o"i  —  0"2 1  <  e  if  cti  and  02  are  Riemann  sums  of  /  over  partitions  P\  and  P2 
of  [a,  b]  with  norms  less  than  S. 
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(b)  Suppose  that  there  is  an  M  >  0  such  that,  for  every  S  >  0,  there  are  Riemann 
sums  d\  and  02  over  a  partition  P  of  [a,  b]  with  \\P\\  <  S  such  that  | cri  —  o"2 1  > 
M.  Use  (a)  to  prove  that  /  is  not  integrable  over  [a,  b], 

3.  Suppose  that  f  f  (x)  dx  exists  and  there  is  a  number  A  such  that,  for  every  e  >  0 
and  S  >  0,  there  is  a  partition  P  of  [a,  b]  with  ||  P  \\  <  S  and  a  Riemann  sum  a  of  f 
over  P  that  satisfies  the  inequality  \a  —  A\  <  e.  Show  that  f£  f  (x)  dx  =  A. 

4.  Prove  directly  from  Definition  3.1.1  that 

cb  u3  „3 


/ 

J  a 


x  dx 


Do  not  assume  in  advance  that  the  integral  exists.  The  proof  of  this  is  part  of  the 
problem.  HINT:  Let  P  =  {xq,  x-i,  . . . ,  x„)  be  an  arbitrary  partition  of  [a,b\.  Use 
the  mean  value  theorem  to  show  that 


3  J-l 


for  some  points  d\,  d„,  where  Xj—i  <  dj  <  Xj.  Then  relate  this  sum  to 
arbitrary  Riemann  sums  for  f  (x)  =  x2  over  P. 

5.     Generalize  the  proof  of  Exercise  3. 1 .4  to  show  directly  from  Definition  3.1.1  that 


L 


b  um+\ 

xmdx  = 


m  +  1 
if  m  is  an  integer  >  0. 

Prove  directly  from  Definition  3.1.1  that  f(x)  is  integrable  on  [a,  b]  if  and  only  if 
f(—x)  is  integrable  on  [— b,  —a],  and,  in  this  case, 

-b 


po  r—a 

I    f(x)dx=  I  f(—x)dx. 

Ja    '  J-b 


7.  Let  /  be  bounded  on  [a,  b]  and  let  P  be  a  partition  of  [a,b].  Prove:  The  lower  sum 
s(P)  of  f  over  P  is  the  infimum  of  the  set  of  all  Riemann  sums  of  /  over  P . 

8.  Let  /  be  defined  on  [a,  b]  and  let  P  =  {xq,  X\, . . . ,  xn}  be  a  partition  of  [a,  b\. 

(a)  Prove:  If  /  is  continuous  on  [a,b],  then  s(P)  and  S(P)  are  Riemann  sums  of 
/  over  P . 

(b)  Name  another  class  of  functions  for  which  the  conclusion  of  (a)  is  valid. 

(c)  Give  an  example  where  s(P)  and  S(P)  are  not  Riemann  sums  of  /  over  P . 
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Find  Jq  f  (x)  dx  and  Jq  f  (x)  dx  if 


/  \  f/  \  _  \    x     if  *  is  rational,         ,,  ..  f(^_\  1     if  JC  is  rational, 
^  '  |  —x     if  x  is  irrational.       \  )  J  \  >      \x     if^is  irrational. 

10.  Given  that  f  ex  dx  exists,  evaluate  it  by  using  the  formula 

1  _  rn+l 

l  +  r  +  r2  +  ---  +  rn  =  —   (r  ^  1) 

1  —  r 

to  calculate  certain  Riemann  sums.  Hint:  See  Exercise  3.1.3. 

11.  Given  that  L  sinx  dx  exists,  evaluate  it  by  using  the  identity 

cos(y  —  1)6  —  cos(y  +  1)6  =  2sin#sin  jd 

to  calculate  certain  Riemann  sums.  Hint:  See  Exercise  3.1.3. 
•b 
o 

sin(y  +  1)6*  —  sin(y'  —  1)6  =  2  sin  6  cos  j6 


12.     Given  that  /0  cos  x  dx  exists,  evaluate  it  by  using  the  identity 


to  calculate  certain  Riemann  sums.  Hint:  See  Exercise  3.1.3. 

13.  Show  that  if  g(x)  =  x  +  c  (c=constant),  then  f(x)  dg(x)  exists  if  and  only  if 
f  f  (x)  dx  exists,  in  which  case 

pb  i>b 

/    f(x)dg(x)  =  /  f(x)dx. 

J  a  J  a 

14.  Suppose  that  — oo  <a<d<c<oo  and 

.  .      \g\,    a  <  x  <  d,  , 
g(x)=\g2,    d<x<b,   (^1^2  =  constants), 

and  let  g(a),  g(b),  and  g(d)  be  arbitrary.   Suppose  that  /  is  defined  on  [a,b], 
continuous  from  the  right  at  a  and  from  the  left  at  b,  and  continuous  at  d .  Show  that 
f  (x)  dg(x)  exists,  and  find  its  value. 

15.  Suppose  that  — oo  <  a  =  ao  <  ci\  <  ■■■  <  ap  =  b  <  oo,  let  g(x)  =  gm 
(constant)  on  (am-\ ,  am),  \<m<  p,  and  let  g(ao),  g{a\ ),•••,  giflp)  be  arbitrary. 
Suppose  that  /  is  defined  on  [a,  b],  continuous  from  the  right  at  a  and  from  the 
left  at  b,  and  continuous  at  a\,  ■  ■  ■ ,  ap~\ .  Evaluate  f  f  (x)  dg{x).  Hint:  See 
Exercise  3.1.14. 

16.  (a)   Give  an  example  where      f(x)dg(x)  exists  even  though  /  is  unbounded 

on  [a,  b].  (Thus,  the  analog  of  Theorem  3.1.2  does  not  hold  for  the  Riemann- 
Stieltjes  integral.) 

(b)    State  and  prove  an  analog  of  Theorem  3.1 .2  for  the  case  where  g  is  increasing. 
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17.     For  the  case  where  g  is  nondecreasing  and  /  is  bounded  on  [a,  b],  define  upper  and 
lower  Riemann-Stieltjes  integrals  in  a  way  analogous  to  Definition  3.1.3. 

3.2  EXISTENCE  OF  THE  INTEGRAL 

The  following  lemma  is  the  starting  point  for  our  study  of  the  integrability  of  a  bounded 
function  /  on  a  closed  interval  [a,  b]. 

Lemma  3.2.1  Suppose  that 

|/(jc)|<M,    a<x<b,  (3.2.1) 

and  let  P'  be  a  partition  of  [a,  b]  obtained  by  adding  r  points  to  a  partition  P  =  {xq,  Xi, . . . ,  Xn} 
of  [a,  b].  Then 

S(P)>S(P')>S(P)-2Mr\\P\\  (3.2.2) 

and 

s(P)  <  s(P')  <  s(P)  +  2Mr\\P\\.  (3.2.3) 

Proof  We  will  prove  (3.2.2)  and  leave  the  proof  of  (3.2.3)  to  you  (Exercise  3.2.1). 
First  suppose  that  r  =  1,  so  P'  is  obtained  by  adding  one  point  c  to  the  partition  P  = 
{xq,  Xi,  . . . ,  xn);  then  .y,_i  <  c  <  xt  for  some  i  in  {1, 2, . . . ,  n}.  If  j  ^  i,  the  prod- 
uct Mj{xj  —  Xj-\)  appears  in  both  S(P)  and  S(P')  and  cancels  out  of  the  difference 
S(P)  -  S(P').  Therefore,  if 

Mn  =     sup     f(x)    and     Mi%  =    sup  fix), 

Xj  —  l<X<C  C<X<Xj 

then 

SOP)  -  S(P')  =  Mi(Xi  -  xt-i)  -  Mn(c  -  jc,_i)  -  Mi2(Xi  -  c) 

(3.2.4) 

=  {Mi  -  Mn)(c  -  Xi-i)  +  (Mi  -  Mi2)(xi  -  c). 
Since  (3.2.1)  implies  that 

0  <  Mi  -  Mir  <  2M,    r  =  1,2, 

(3.2.4)  implies  that 

0  <  S(P)  -  S(P')  <  2M(xi  -  Xi-!)  <  2M||P||. 

This  proves  (3.2.2)  for  r  =  \. 

Now  suppose  that  r  >  1  and  P'  is  obtained  by  adding  points  c\,  c2,  ■  ■  ■ ,  cr  to  P .  Let 
p(o)  _  p  an(j^  for  y  >  i5  let  Pw)  be  the  partition  of  [a,  b]  obtained  by  adding  Cj  to 
p{j-i)  xhen  the  result  just  proved  implies  that 

0<  S{Pu~l))  -S(P0))  <  2M\\PU~1}\\,     1  <j  <  r. 
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Adding  these  inequalities  and  taking  account  of  cancellations  yields 

0<  S(P{0))-  S(P(r))  <  2M(\\P{0)\\  +  \\P{1)\\  +■■■+  ||JP(r-1)||).  (3.2.5) 

Since  f(0)  =  P,  P^r)  =  P' ,  and  \\P{k)\\  <  ||P(A:_1)||  fori  <  k  <  r  -  1,  (3.2.5)  implies 
that 

0<  S(P)-S(P')  <2Mr\\P\\, 
which  is  equivalent  to  (3.2.2).  H 

Theorem  3.2.2  Iff  is  bounded  on  [a,  b],  then 

\   f{x)dx  <  I  f(x)dx.  (3.2.6) 

J  a  Ja 

Proof  Suppose  that  Pi  and  P-i  are  partitions  of  [a,  b]  and  P'  is  a  refinement  of  both. 
Letting  P  =  Pi  in  (3.2.3)  and  P  =  P2  in  (3.2.2)  shows  that 

s(Pi)  <  s{P')    and    S(P')  <  S(P2). 


Since  s(P')  <  S(P'),  this  implies  that  s(.Pi)  <  S(i32).  Thus,  every  lower  sum  is  a  lower 

bour 
that 


bound  for  the  set  of  all  upper  sums.  Since     f(x)  dx  is  the  infimum  of  this  set,  it  follows 


s(Pi)  <  f  f(x)dx 

J  a 

for  every  partition  Pi  of  [a,  b].  This  means  that  f  fix)  dx  is  an  upper  bound  for  the  set 
of  all  lower  sums.  Since     f(x)  dx  is  the  supremum  of  this  set,  this  implies  (3.2.6).  H 

Theorem  3.2.3  // /  is  integrable  on  [a,b],  then 

•  b  ~r~b  i-b 


no  1*0  ru 

I   f(x)dx=  I   f(x)dx=  I  f(x)dx. 

Ja  J  a  J  a 


Proof   We  prove  that     f(x)  dx  =  f  f  (x)  dx  and  leave  it  to  you  to  show  that  fa  f(x)  dx  = 
f(x)dx  (Exercise  3.2.2). 

Suppose  that  P  is  a  partition  of  [a,  b]  and  a  is  a  Riemann  sum  of  /  over  P .  Since 

pb  nb  I   i>b  \ 

/  f(x)dx-      f(x)dx={      f(x)dx-S(P)  \+(S(P)-a) 

J  a  J  a  \Ja  J 

rb  \ 

+  I  a  -  I    f(x)  dx  , 
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the  triangle  inequality  implies  that 


I 


f  (x)  dx  —  I    f  (x)  dx 

J  a 


{' 

J  a 


+ 


fix)  dx  -  S(P) 

b 


+  \S(P)-a\ 


J  a 


f(x)  dx 


(3.2.7) 


Now  suppose  that  e  >  0.  From  Definition  3.1.3,  there  is  a  partition  Pq  of  [a,  b]  such  that 


i>b  i>b 

/  f{x)  dx  <  S(P0)  <  /  fix)  dx  + 

Ja  Ja 


From  Definition  3.1.1,  there  is  a  S  >  0  such  that 

"b 


J  a 


f  ix)  dx 


€ 

<  - 

3 


(3.2.8) 


(3.2.9) 


if  ||  P  ||  <  S.  Now  suppose  that  \\P\\  <  S  and  P  is  a  refinement  of  ^o-  Since  S(P)  <  SiPo) 
by  Lemma  3.2.1,  (3.2.8)  implies  that 


f 

J  a 


fix)  dx  <  S(P)  <  /   fix)  dx  + 


f 

J  a 


SO 


L 


S(P)-  /  f(x)dx 


in  addition  to  (3.2.9).  Now  (3.2.7),  (3.2.9),  and  (3.2.10)  imply  that 


I 


b  pb 
f(x)dx—  I  f(x)dx 

J  a 


<  *+\S(P)-ct\ 


(3.2.10) 


(3.2.11) 


for  every  Riemann  sum  a  of  /  over  P.  Since  SiP)  is  the  supremum  of  these  Riemann 
sums  (Theorem  3.1.4),  we  may  choose  a  so  that 


Now  (3.2.1 1)  implies  that 


I 


\S(P)-a\  <  -. 


nb 

f  ix)  dx  —  I    f  ix)  dx 

J  a 


<  €. 


Since  e  is  an  arbitrary  positive  number,  it  follows  that 


pb  rb 

I  f(x)dx=  I  fix)dx. 

J  a  J  a 


H 
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Lemma  3.2.4  If  f  is  bounded  on  [a,  b]  and  e  >  0,  there  is  a  8  >  0  such  that 


I  f(x)dx<S(P)<      f(x)dx+e  (3.2.12) 

J  a  J  a 

and 

pb  pb 

I   f(x)dx>s{P)>  I  f(x)dx-e 

J  a  J  a 


if\\P\\<8. 


Proof  We  show  that  (3.2. 1 2)  holds  if  ||  P  \\  is  sufficiently  small,  and  leave  the  rest  of  the 
proof  to  you  (Exercise  3.2.3). 

The  first  inequality  in  (3.2.12)  follows  immediately  from  Definition  3.1.3.  To  establish 
the  second  inequality,  suppose  that  |/(jc)|  <  K  \fa  <  x  <  b.  From  Definition  3. 1.3,  there 
is  a  partition  Pq  =  {xo,  x\, . . . ,  xr+\)  of  [a,  b]  such  that 

S(P0)  <  /  f(x)dx  +  -.  (3.2.13) 

J  a  * 

If  P  is  any  partition  of  [a,b],  let  P'  be  constructed  from  the  partition  points  of  Pq  and  P . 
Then 

S(P')  <  S{P0),  (3.2.14) 

by  Lemma  3.2.1.  Since  P'  is  obtained  by  adding  at  most  r  points  to  P ,  Lemma  3.2.1 
implies  that 

S(P')  >  S(P)-2Kr\\P\\.  (3.2.15) 
Now  (3.2.13),  (3.2.14),  and  (3.2.15)  imply  that 

S(P)  <  S(P')  +2Kr\\P\\ 
<  S(i50)  +  2^r||/»|| 


\  /(J» 
J  a 


<     /    I  {x)dx  +€-+2Kr\\P\\, 


Therefore,  (3.2.12)  holds  if 

\\p\\<S=—.  TJ 

Theorem  3.2.5  If  f  is  bounded  on  [a,  b]  and 

\  f(x)dx=  /  f(x)dx  =  L,  (3.2.16) 

J  a  J  a 

then  f  is  integrable  on  [a,  b]  and 

f  f(x)dx  =  L.  (3.2.17) 

J  a 
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Proof   If  €  >  0,  there  is  a  S  >  0  such  that 

pb  pb 

j  f{x)dx-€  <s(P)  <  S(P)  <      f(x)dx  +  e  (3.2.18) 

J  a  J  a 

if  ||  P  ||  <  S  (Lemma  3.2.4).  If  a  is  a  Riemann  sum  of  /  over  P,  then 

s(P)  <  a  <  S(P), 

so  (3.2.16)  and  (3.2.18)  imply  that 

L  —  e  <  a  <  L  +  e 

if  ||P  ||  <  <5.  Now  Definition  3.1.1  implies  (3.2.17).  H 
Theorems  3.2.3  and  3.2.5  imply  the  following  theorem. 

Theorem  3.2.6  A  bounded  function  f  is  integrable  on  [a,b]  if  and  only  if 

rb  ~~rT 


I   f(x)dx=l  f(x)dx. 

J  a  J  a 


The  next  theorem  translates  this  into  a  test  that  can  be  conveniently  applied. 

Theorem  3.2.7  If  f  is  bounded  on  [a,b],  then  f  is  integrable  on  [a,  b]  if  and  only  if 
for  each  e  >  0  there  is  a  partition  P  of  [a,  b]  for  which 

S(P)-s(P)  <€.  (3.2.19) 

Proof  We  leave  it  to  you  (Exercise  3.2.4)  to  show  that  if  f  f  (x)  dx  exists,  then  (3.2.19) 
holds  for  ||  P  ||  sufficiently  small.  This  implies  that  the  stated  condition  is  necessary  for  in- 
tegrability.  To  show  that  it  is  sufficient,  we  observe  that  since 

pb  pb 

s(P)<  /   f(x)dx<  /  f(x)dx<S(P) 

J  a  J  a 

for  all  P,  (3.2.19)  implies  that 

r>b  pb 

0<  /  f(x)dx-  f(x)dx<e. 

Ja  J  a 


Since  e  can  be  any  positive  number,  this  implies  that 


pb  pb 

I   f(x)dx=  I  f(x)dx. 

Ja  Ja 


Therefore,  f  f  (x)  dx  exists,  by  Theorem  3.2.5.  H 
The  next  two  theorems  are  important  applications  of  Theorem  3.2.7. 
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Theorem  3.2.8  Iff  is  continuous  on  [a,  b],  then  f  is  integrable  on  [a,  b]. 

Proof  Let  P  =  {xq,  xi, . . .  ,Xn}  be  a  partition  of  [a,  b].  Since  /  is  continuous  on  [a,  b], 
there  are  points  Cj  and  c'-  in  [Xj-i,  Xj]  such  that 

f(cj)  =  Mj  =      sup  f{x) 

X  j  —  \ <X<X  j 

and 

f(c'j)  =  mj=       inf  /(x) 

X  j  —  \<X<X  j 

(Theorem  2.2.9).  Therefore, 

n 

S(P)  -  s(P)  =  J2  [f(cj)  ~  f(c'j)]  (xj  -  xj.,).  (3.2.20) 
j=i 

Since  /  is  uniformly  continuous  on  [a,  b]  (Theorem  2.2.12),  there  is  for  each  €  >  OaiS  >  0 
such  that 

1/00-/001  <  7^— 

b  —  a 

if  x  and  x'  are  in  [a,  b]  and \x  —  x'\  <5.If||.P||  <  S,  then  \cj  —  c'j  \  <  S  and,  from  (3.2.20), 

n 

S(P)  -  s(P)  <  ~  xj-i)  = 

b  —  a  *— ' 

.7=1 

Hence,  /  is  integrable  on  [a,  b],  by  Theorem  3.2.7.  H 

Theorem  3.2.9  Iff  is  monotonic  on[a,b],  then  f  is  integrable  on  [a ,  b]. 

Proof   Let  P  =  {xo,  Xi, xn)  be  a  partition  of  [a,  b].  Since  /  is  nondecreasing, 

f(Xj)  =  Mj=      sup  f(x) 

X  j  —  1 <X<X  j 

and 

f(xj-i)=mj=       inf  f(x), 

Xj  —  \<X<Xj 

Hence, 

n 

S(P)-s(P)  =  ^2(f(xj)-  fQcj-iMxj-Xj-i). 

7  =  1 

Since  0  <  Xj  —  Xj-i  <  ||P ||  and  / (xj)  —  f  (xj-i)  >  0, 

n 

S(P)-S(P)  <  ||p ||  £(/(*;)-  /(xy-0) 
./=1 

=  II^IK/W-ZCa)). 
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Therefore, 

S(P)-s(P)<e    if  \\P\\(f(b)-f(a))<e, 

so  /  is  integrable  on  [a,  b],  by  Theorem  3.2.7. 

The  proof  for  nonincreasing  /  is  similar.  H 

We  will  also  use  Theorem  3.2.7  in  the  next  section  to  establish  properties  of  the  integral. 
In  Section  3.5  we  will  study  more  general  conditions  for  integrability. 

3.2  Exercises 


1.  Complete  the  proof  of  Lemma  3.2.1  by  verifying  Eqn.  (3.2.3). 

2.  Show  that  if  /  is  integrable  on  [a,b],  then 

-b  rb 


I   f(x)dx=  J  f(x)dx. 

J  a  J  a 


3.    Prove:  If  /  is  bounded  on  [a,b],  there  is  for  each  e  >  0  a  8  >  0  such  that 

rb  rb 


i  f{x)dx>  I  f(x)dx-e  <s(P) 

J  a  J  a 


if  II P II  <  S. 

4.  Prove:  If  /  is  integrable  on  [a,b]  and  e  >  0,  then  S(P)  —  s(P)  <  e  if  \\P\\  is 
sufficiently  small.  Hint:  Use  Theorem  3.1.4. 

5.  Suppose  that  /  is  integrable  and  g  is  bounded  on  [a,  b],  and  g  differs  from  /  only 
at  points  in  a  set  H  with  the  following  property:  For  each  e  >  0,  H  can  be  covered 
by  a  finite  number  of  closed  subintervals  of  [a,  b],  the  sum  of  whose  lengths  is  less 
than  e.  Show  that  g  is  integrable  on  [a,  b]  and  that 

rb  rb 


po  no 

I    g(x)dx=  J  f(x)dx. 

Ja  Ja 


Hint:  Use  Exercise  3.1.3. 

Suppose  that  g  is  bounded  on  [a,  ff\,  and  let  Q  :  a  =  vq  <  v\  <  ■  ■  ■  <  vl  =  P  be 
a  fixed  partition  of  [a,  ff\.  Prove: 


L    „.,„  „r  L 


rP  _     rvi  rp  _  rvi 

(a)     /   g{u)du  =       /     g(u)du;     (b)     /   g(u)du  =       /  g(u)du. 

7.    A  function  /  is  of  bounded  variation  on  [a,  b]  if  there  is  a  number  K  such  that 

n 

J2\f(aj)-  f(aj-i)\<K 

.7=1 

whenever  a  =  ao  <  a\  <  ■■■  <  a„  =  b.  (The  smallest  number  with  this  property 
is  the  total  variation  of  f  on  [a,  b\.) 
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(a)  Prove:  If  /  is  of  bounded  variation  on  [a,b],  then  /  is  bounded  on  [a,  b\. 

(b)  Prove:  If  /  is  of  bounded  variation  on  [a,b],  then  /  is  integrable  on  [a,  b\. 
Hint:  Use  Theorems  3.1 .4  and  3.2.7. 

8.    Let  P  =  {xq,  xi,  . . . ,  xn}  be  a  partition  of  [a,  b],  cq  =  xq  =  a,  cn+\  =  x„  =  b, 
and  Xj-i  <  Cj  <  Xj,  j  =  1,  2, . . . ,  n.  Verify  that 


J>(c/)[/(*/)-/(*y-i)]  =  g(6)/(6)-g(a)/(a)-E  f(xj)[g(cJ+1)-g(cj)]. 


(This  is  the  integration  by  parts  formula  for  Riemann-Stieltjes  integrals.) 

9.    Let  /  be  continuous  and  g  be  of  bounded  variation  (Exercise  3.2.7)  on  [a,  b], 

(a)  Show  that  if  e  >  0,  there  is  a  8  >  0  such  that  \a  —  a'\  <  e/2  if  o  and  tr' 
are  Riemann-Stieltjes  sums  of  /  with  respect  to  g  over  partitions  P  and  P' 
of  [a,&],  where  P'  is  a  refinement  of  P  and  <  S.  Hint:  Use  Theo- 
rem 2.2.12. 

(b)  Let  S  be  as  chosen  in  (a).  Suppose  that  0\  and  a-i  are  Riemann-Stieltjes 
sums  of  /  with  respect  to  g  over  any  partitions  Pi  and  Pi  of  [a,  b]  with  norm 
less  than  S.  Show  that  \a\  —  CT2I  <  £■ 

(c)  If  5  >  0,  let  L(<5)  be  the  supremum  of  all  Riemann-Stieltjes  sums  of  /  with 
respect  to  g  over  partitions  of  [a,  b]  with  norms  less  than  S.  Show  that  L(S)  is 
finite.  Then  show  that  L  =  lim^^o+  L(fi)  exists.  Hint:  Use  Theorem  2.1.9. 

(d)  Show  that      f(x)  dg{x)  =  L. 

10.     Show  that      f  (x)  dg(x)  exists  if  /  is  of  bounded  variation  and  g  is  continuous  on 
[a,  b].  Hint:  See  Exercises  3.2.8  and  3.2.9. 

3.3  PROPERTIES  OF  THE  INTEGRAL 

We  now  use  the  results  of  Sections  3.1  and  3.2  to  establish  the  properties  of  the  integral. 
You  are  probably  familiar  with  most  of  these  properties,  but  not  with  their  proofs. 

Theorem  3.3.1  // /  and  g  are  integrable  on  [a,  b],  then  so  is  f  +  g,  and 


n 


n 


Use  this  to  prove  that  if  f  f  (x)  dg(x)  exists,  then  so  does  f  g(x)  df(x),  and 


/ (x)  dx  + 
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Proof  Any  Riemann  sum  of  /  +  g  over  a  partition  P  =  {xq,  x\,  . . . ,  xn}  of  [a,  b]  can 
be  written  as 


°f+g  =  J2  WW  +  g(Pj)K*J  ~  */-0 


j=i 

n  n 

=  J2  f(cNxJ  -  xJ-o  +    g(cj)(xJ  -  xJ-i) 
j=i  j=i 

=  Of+Og, 

where  a/  and  og  are  Riemann  sums  for  /  and  g.  Definition  3.1.1  implies  that  if  e  >  0 
there  are  positive  numbers  S\  and  82  such  that 


Of 


J  a 


x)  dx 


and 


0jr  -  /  g(x)  dx 

J  a 

If  ||P  ||  <  8  =  mm(8i,82),  then 


<-    if  ||P|l<3i 
2         "  " 


<  -    if  \\P\\<82. 
2         "    "  1 


fib  fib  /  />/?  \  /  />/?  ^ 

°f+g-  \    f(x)dx-  I   g(x)dx  =  [of-j    f(x)dx\  +  log-  g(x)dx 

Ja  Ja  \  J  a  J        \  J  a  < 

pb  pb 

Of—I    f (x)  dx  +  ag  —  I  g(x)dx 

Ja  Ja 


e  e 
<    2  +  2=€- 
so  the  conclusion  follows  from  Definition  3.1.1. 

The  next  theorem  also  follows  from  Definition  3.1.1  (Exercise  3.3.1). 


TJ 


Theorem  3.3.2  If  f  is  integrable  on  [a,  b]  and  c  is  a  constant,  then  c f  is  integrable 
on  [a,  b]  and 


I    cf(x)dx  =  c  I  f(x)dx. 

J  a  J  a 


Theorems  3.3.1  and  3.3.2  and  induction  yield  the  following  result  (Exercise  3.3.2). 

Theorem  3.3.3  If  f\,  f2,  /„  are  integrable  on  [a,b]  and  c\,  c2,  cn  are 
constants,  then  C\f\  +  c2f2  H  +  c„f„  is  integrable  on  [a,  b]  and 

pb  i>b  pb 

/   (ci/i +C2/2  H  \-cnfn)(x)dx  =  c\  /    f\(x)dx+c2  I  fi(x)dx 

J  a  J  a  J  a 


+  ---  +  C 


if 

n  J  Jn 

J  a 


(x)  dx. 


Section  3.3  Properties  of  the  Integral  137 


Theorem  3.3.4  /  and  g  are  integrable  on  [a,  b]  and  f(x)  <  g(x)  for  a  <  x  <  b, 
then 


\    f(x)dx  <  /    g(x)dx.  (3.3.1) 

J  a  Ja 

Proof  Since  g(x)  —  f  (x)  >  0,  every  lower  sum  of  g  —  f  over  any  partition  of  [a,  b]  is 
nonnegative.  Therefore, 

(g(x)-f(x))dx>0. 


f 

J  a 


Hence, 

r-6  rb  rb 


ru  pu  pu 

I    g(x)  dx-       f{x)  dx  =      igix)  -  fix))  dx 

J  a  J  a  J  a 


L 


b  (3.3.2) 
(g(x)-f(x))dx>0, 


which  yields  (3.3.1).  (The  first  equality  in  (3.3.2)  follows  from  Theorems  3.3.1  and  3.3.2; 
the  second,  from  Theorem  3.2.3.)  H 

Theorem  3.3.5  Iff  is  integrable  on  [a,b],  then  so  is  \f\,  and 

I  f{x)dx  <  f  \f(x)\dx.  (3.3.3) 

J  a  J  a 

Proof   Let  P  be  a  partition  of  [a,  b]  and  define 

Mj  =  sup  { / ix)  |  Xj-i  <  x  <  Xj]  , 
irij  =  inf  { / (x)  |  Xj-\  <  x  <  Xj}  , 
Mj  =  sup{|/(x)|  |  xj-i  <x<  xj}  , 
rhj  =  inf  { |  / (x)  |  |  Xj-\  <  x  <  Xj }  . 


Then 


Therefore, 


Mj  —mj  =  sup  —  |/(x')|  |  Xj-i  <  x,x'  <  Xj} 

<  sup { I /(x) -/(*') I  \xj-i  <x,x'<  Xj}  (3.3.4) 
=  Mj  —  m  j . 

~SiP)-siP)  <  S(P)-s(P), 


where  the  upper  and  lower  sums  on  the  left  are  associated  with  |  / 1  and  those  on  the  right  are 
associated  with  /.  Now  suppose  that  e  >  0.  Since  /  is  integrable  on  [a,  b],  Theorem  3.2.7 
implies  that  there  is  a  partition  P  of  [a,  b]  such  that  SiP)  —  s{P)  <  e.  This  inequality 
and  (3.3.4)  imply  that  SiP)  —  ~siP)  <  e.  Therefore,  \  f\  is  integrable  on  [a,  b],  again  by 
Theorem  3.2.7. 

Since 

/(*)<[/(*)!    and     -/(jc)<|/(jc)|,  a<x<b, 
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Theorems  3.3.2  and  3.3.4  imply  that 

pb  pb  pb  pb 

\    f(x)dx<  I    \f(x)\dx    and  -  /    f(x)dx<  \f(x)\dx, 

J  a  J  a  J  a  J  a 

which  implies  (3.3.3).  H 

Theorem  3.3.6  If  f  and  g  are  integrable  on  [a,  b],  then  so  is  the  product  fg. 

Proof  We  consider  the  case  where  /  and  g  are  nonnegative,  and  leave  the  rest  of  the 
proof  to  you  (Exercise  3.3.4).  The  subscripts  /,  g,  and  fg  in  the  following  argument 
identify  the  functions  with  which  the  various  quantities  are  associated.  We  assume  that 
neither  /  nor  g  is  identically  zero  on  [a,  b],  since  the  conclusion  is  obvious  if  one  of  them 
is. 

If  P  =  {xq,  Xi,...,  xn}  is  a  partition  of  [a,  b],  then 

n 

Sfg(P)  -  sfg(P)  =  J2^MfsJ  -  mfsMxJ  -  (3-3-5) 

Since  /  and  g  are  nonnegative,  Mfgj  <  M/jMgj  and  m fgj  >  m fjmgj.  Hence, 

MfgJ-mfgJ  <  MfjMgj  -mfjmgj 

=  (MfJ  ~  mfj)Mg,i  +  mfj(MgJ  ~  mg,i) 
<  Mg(MfJ  -  mfJ)  +  Mf(Mgj  -  mgJ), 

where  Mf  and  Mg  are  upper  bounds  for  /  and  g  on  [a,b\.  From  (3.3.5)  and  the  last 
inequality, 

Sfg(P)-sfg(P)  <  Mg[Sf(P)-sf(P)]  +  Mf[Sg(P)  -  sg(P)].  (3.3.6) 

Now  suppose  that  e  >  0.  Theorem  3.2.7  implies  that  there  are  partitions  P\  and  P-i  of 
[a ,  b]  such  that 

Sf(P1)-sf(P1)<^-    and    Sg(P2)-sg(P2)  <  (3.3.7) 

If  P  is  a  refinement  of  both  P\  and  Pi,  then  (3.3.7)  and  Lemma  3.2.1  imply  that 

Sf(P)sf(P)<^-    and  Sg(P)-sg(P)<^. 
This  and  (3.3.6)  yield 

Sfg(P)-Sfg(P)<e-+€-=€. 

Therefore,  fg  is  integrable  on  [a,  b],  by  Theorem  3.2.7.  H 
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Theorem  3.3.7  (First  Mean  Value  Theorem  for  Integrals)  Suppose  that 
u  is  continuous  and  v  is  integrable  and  nonnegative  on  [a,  b].  Then 

b  rb 
u(x)v(x)  dx  =  u(c)  I    v(x)dx  (3.3.8) 

J  a 

for  some  c  in  [a,  b]. 

Proof   From  Theorem  3.2.8,  u  is  integrable  on  [a,  b].  Therefore,  Theorem  3.3.6  implies 
that  the  integral  on  the  left  exists.  If  m  =  min  \u(x)  |  a  <  x  <  b}  and  M  =  max  {w(x)  |  a  <  x 
(recall  Theorem  2.2.9),  then 

m  <  u(x)  <  M 

and,  since  v{x)  >  0, 

mv(x)  <  u(x)v(x)  <  Mv(x). 
Therefore,  Theorems  3.3.2  and  3.3.4  imply  that 

pb  fb  pb 

m  I    v(x)dx<  I    u(x)v(x)  dx  <  M  I    v(x)dx.  (3.3.9) 

J  a  J  a  J  a 

This  implies  that  (3.3.8)  holds  for  any  c  in  [a,  b]  if  f  v(x)  dx  =  0.  If     v(x)  dx  ^  0, 

let 


(3.3.10) 


Since  /  v(x)dx  >  0  in  this  case  (why?),  (3.3.9)  implies  that  m  <  u  <  M,  and  the 
intermediate  value  theorem  (Theorem  2.2.10)  implies  that  u  =  u(c)  for  some  c  in  [a,  b]. 
This  implies  (3.3.8).  U 

If  v(x)  =  1,  then  (3.3.10)  reduces  to 


so  u  is  the  average  of  u(x)  over  [a,b].  More  generally,  if  v  is  any  nonnegative  integrable 
function  such  that  f  v(x)  dx  ^  0,  then  u  in  (3.3. 10)  is  the  weighted  average  ofu(x)  over 
[a,  b]  with  respect  to  v.  Theorem  3.3.7  says  that  a  continuous  function  assumes  any  such 
weighted  average  at  some  point  in  [a,  b]. 


Theorem  3.3.8  If  f  is  integrable  on  [a,  b]  and  a  <  a\  <  b\  <  b,  then  f  is  integrable 
on  [ai,  b\]. 
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Proof  Suppose  thate  >  0.  From  Theorem  3.2.7,  there  is  a  partition  P  =  {xo,  X\, . . . ,  xn] 
of  [a,  b]  such  that 

n 

S(P)-s(P)  =  J2(MJ -mjXxj-Xj-!)  <e.  (3.3.11) 

7=1 

We  may  assume  that  a  \  and  b\  are  partition  points  of  P ,  because  if  not  they  can  be  inserted 
to  obtain  a  refinement  P'  such  that  S(P')  -  s(P')  <  S(P)  -  s(P)  (Lemma  3.2.1).  Let 
a\  =  xr  and  b\  =  xs.  Since  every  term  in  (3.3.1 1)  is  nonnegative, 

^2  (Mj  -mj)(xj  -xj-i)  <  e. 

j=r+ 1 

Thus,  P  =  {xr,  xr+\, . . . ,  xs}  is  a  partition  of  [a\,  b\\  over  which  the  upper  and  lower 
sums  of  /  satisfy 

S(P)-s(P)  <€. 

Therefore,  /  is  integrable  on  [a\,  b\\,  by  Theorem  3.2.7.  TJ 
We  leave  the  proof  of  the  next  theorem  to  you  (Exercise  3.3.8). 

Theorem  3.3.9  If  f  is  integrable  on  [a,  b]  and  [b,  c],  then  f  is  integrable  on  [a,  c], 
and 


(3.3.12) 


So  far  we  have  defined  ff  / (x)  dx  only  for  the  case  where  a  <  f>.  Now  we  define 


if  a  <  /3,  and 

f   f{x)  dx  =  0. 

With  these  conventions,  (3.3.12)  holds  no  matter  what  the  relative  order  of  a,  b,  and  c, 
provided  that  /  is  integrable  on  some  closed  interval  containing  them  (Exercise  3.3.9). 

Theorem  3.3.8  and  these  definitions  enable  us  to  define  a  function  F(x)  =  f*  f  (t)  dt, 
where  c  is  an  arbitrary,  but  fixed,  point  in  [a.b]. 

Theorem  3.3.10  If  f  is  integrable  on  [a,b]  and  a  <  c  <  b,  then  the  function  F 
defined  by 

F(x)  =  j*  f(t)dt 
satisfies  a  Lipschitz  condition  on  [a,b\,  and  is  therefore  continuous  on  \a,b\. 
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Proof   If  x  and  x'  are  in  [a,  b\,  then 

f(x)-f(x')=  r  f(t)dt~  r  f{t)dt=  f  f{t)dt, 

J  c  J  c  J  X 

by  Theorem  3.3.9  and  the  conventions  just  adopted.  Since  \f(t)\  <  K  (a  <  t  <  b)  for 
some  constant  K, 


i/; 


f(t)dt 


<  K\x  -  x'\,    a  <  x,  x'  < 


(Theorem  3.3.5),  so 


\F(x)  -  F(x')\  <  K\x-x'\,  a<x,x'<. 


TJ 


Theorem  3.3.11  // /  is  integrable  on  [a,  b]  anda  <  c  <  b,  then  F{x)  =  f*  f(t)  dt 
is  differentiable  at  any  point  Xq  in  (a,  b)  where  f  is  continuous,  with  F'(xq)  =  f(xo)-  If 
f  is  continuous  from  the  right  at  a,  then  F'+(a)  =  f(a).  Iff  is  continuous  from  the  left 
atb,thenF!_(b)  =  f(b). 

Proof  We  consider  the  case  where  a  <  xq  <  b  and  leave  the  rest  to  you  (Exer- 
cise 3.3.14).  Since 

1  fx 
  /    f(x0)  dt  =  f{x0), 

X  —  Xq  Jx0 


we  can  write 

F(x)  -  F(x0) 

X  —  Xq 

From  this  and  Theorem  3.3.5, 
F(x)  -  F(xQ) 


-  f(xQ)  =  f[fit)  -  /(*„)]  dt. 

X  —  Xq  Jx„ 


X  —  Xq 


-  f(x0) 


—\f\f{t)-f(xo)\dt 

-*0|  \Jx0 


(3.3.13) 


(Why  do  we  need  the  absolute  value  bars  outside  the  integral?)  Since  /  is  continuous  at 
xq,  there  is  for  each  e  >  0  a  S  >  0  such  that 

|/(0-/(^o)|  <e    if  \x-x0\<S 
and  t  is  between  x  and  xq.  Therefore,  from  (3.3.13), 


F(x)  -  F(x0) 


X  —  Xq 

Hence,  F'{xq)  =  f(xo). 


-  f(xo) 


<€-  -  =  €     if  0<\X-Xq\<6. 

\X  -Xq\ 


H 
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I  x,  0  <  x  <  1, 
I  x  +  1,    1  <  x  <  2, 


/(*)  = 
then  the  function 

(x2 

0  <  x  <  1, 


n*)= r  f(t)dt= i 2. 


2 

is  continuous  on  [0,  2].  As  implied  by  Theorem  3.3.1 1, 
fx  =  f{x),  0  <  x  <  1, 


2 

+  x  -  1,    1  <  x  <  2, 


[x  +  1  =  /(x),    1  <  x  <  2, 


F(x)-F(0)  (x2/2)-0 
Fi(0)  =   lim  — — '  —  =   lim  V    1  '        =  0  =  /(0), 

T  x-^0+  X  x^-0+  X 

F'(2)=  Urn  ^a  lim  ^  +  *  =  1  =  3 

x^2-         X  —2  x^2-  X  —  2 

=  lim  =  3  =  /(2). 

x^2-  2 

F  does  not  have  a  derivative  at  x  =  1,  where  /  is  discontinuous,  since 

FL(l)  =  1    and    F|(l)  =  2.  ■ 

The  next  theorem  relates  integration  and  differentiation  in  another  way. 

Theorem  3.3.12  Suppose  that  F  is  continuous  on  the  closed  interval  [a,  b]  and  dif- 
ferentiable  on  the  open  interval  (a,b),  and  f  is  integrable  on  [a,  b\.  Suppose  also  that 

F'(x)  =  fix),    a  <  x  <  b. 

Then 


L 


b 

f(x)  dx  =  F{b)  -  F(a).  (3.3.14) 


Proof   If  P  =  {x0  ,  Xi,  — ,  xn}  is  a  partition  of  [a,  b]9  then 

n 

F(b)  -  F(a)  =  £(F(x,)  -  F(xj-i)).  (3.3.15) 

.7=1 

From  Theorem  2.3.1 1,  there  is  in  each  open  interval  (xj-i,Xj)  a  point  Cj  such  that 
F(xj)  -  F{Xj-i)  =  f(cj)(xj  -xj-i). 
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Hence,  (3.3.15)  can  be  written  as 


F(b)  -  F(o)  =       f(cj)(xj  -         =  a, 

./  =  ! 

where  a  is  a  Riemann  sum  for  /  over  P .  Since  /  is  integrable  on  [a,b\,  there  is  for  each 
€  >  0  a  S  >  0  such  that 


J  a 


f  (x)  dx 


<e    if  \\P\\<8. 


Therefore, 


F(b)  -  F(a)  -  f  f( 

J  a 

for  every  e  >  0,  which  implies  (3.3.14). 


x  )  dx 


<  e 


U 


Corollary  3.3.13  If  f  is  integrable  on  [a,  b],  then 

b 

f'(x)dx  =  f(b)-f(a). 

Proof   Apply  Theorem  3.3.12  with  F  and  /  replaced  by  /  and  /',  respectively.  H 

A  function  F  is  an  antiderivative  of  /  on  [a,  b]  if  F  is  continuous  on  [a,  b]  and  differ- 
entiable  on  (a,  b),  with 

F'(x)  =  f(x),    a  <  x  <  b. 

If  F  is  an  antiderivative  of  /  on  [a,  b],  then  so  is  F  +  c  for  any  constant  c.  Conversely, 
if  F\  and  F-i  are  antiderivatives  of  /  on  [a,  b],  then  Fi  —  F-i  is  constant  on  [a,  b]  (Theo- 
rem 2.3.12).  Theorem  3.3.12  shows  that  antiderivatives  can  be  used  to  evaluate  integrals. 

Theorem  3.3.14  (Fundamental  Theorem  of  Calculus)  If  f  is  continu- 
ous on  \a,b\,  then  f  has  an  antiderivative  on  \a,b\.  Moreover,  if  F  is  any  antiderivative 
of  f  on[a,b],  then 

b 

f(x)  dx  =  F(b)  -  F(a). 

Proof  The  function  Fq(x)  =  f  f(t)dt  is  continuous  on  [a,b]  by  Theorem  3.3.10, 
and  Fq(x)  =  f(x)  on  (a,  b)  by  Theorem  3.3.11.  Therefore,  Fq  is  an  antiderivative  of  / 
on  [a,b].  Now  let  F  =  Fq  +  c  (c  =  constant)  be  an  arbitrary  antiderivative  of  /  on  [a,  b\. 
Then 


L 


pb  pa  pb 

F(b)-F(a)=       f(x)dx  +  c-      f(x)dx-c=       f(x)dx.  TJ 

J  a  J  a  J  a 
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When  applying  this  theorem,  we  will  use  the  familiar  notation 

b 

F(b)  -  F{a)  =  F(x)  . 

a 

Theorem  3.3.15  (Integration  by  Parts)  Ifu'andv'areintegrableon[a,b], 
then 

pb  \b  »b 

I    u(x)v'(x)  dx  =  u(x)v(x)\   —  J    v(x)u'(x)dx.  (3.3.16) 

J  a  \a      J  a 

Proof  Since  u  and  v  are  continuous  on  [a,  b]  (Theorem  2.3.3),  they  are  integrable  on 
[a,  b\.  Therefore,  Theorems  3.3.1  and  3.3.6  imply  that  the  function 

(uv)'  =  u'v  +  uv' 

is  integrable  on  [a,b],  and  Theorem  3.3.12  implies  that 

»6 


/ 

J  a 


b 

u(x)v'(x)  +  u'(x)v(x)]  dx  =  u(x)v(x)  , 


which  implies  (3.3.16).  TJ 
We  will  use  Theorem  3.3.15  here  and  in  the  next  section  to  obtain  other  results. 

Theorem  3.3.16  (Second  Mean  Value  Theorem  for  Integrals)  Suppose 
that  f  is  nonnegative  and  integrable  and  g  is  continuous  on  [a,  b].  Then 

f  f(x)g(x)  dx  =  /(c)  f  g(x)  dx  +  f(b)  f  g(x)  dx  (3.3.17) 

J  a  J  a  Jc 

for  some  c  in  [a,  b]. 

Proof  Since  /  is  differentiable  on  [a,b],  it  is  continuous  on  [a,b]  (Theorem  2.3.3). 
Since  g  is  continuous  on  [a,  b],  so  is  fg  (Theorem  2.2.5).  Therefore,  Theorem  3.2.8  implies 
that  the  integrals  in  (3.3.17)  exist.  If 

G(x)  =  f  g(t)dt,  (3.3.18) 

J  a 

then  G'(x)  =  g(x),  a  <  x  <  b  (Theorem  3.3.1 1).  Therefore,  Theorem  3.3.15  with  u  =  f 
and  v  =  G  yields 

f  f{x)g{x)dx  =  f(x)G(x)\   -  f  f'(x)G(x)dx.  (3.3.19) 
Since  /'  is  nonnegative  and  G  is  continuous,  Theorem  3.3.7  implies  that 

pb  r>b 

\    f'(x)G(x)dx  =  G(c)      f'(x)dx  (3.3.20) 

J  a  J  a 
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for  some  c  in  [a,  b].  From  Corollary  3.3.12, 

fb 


J  a 


(x)dx  =  f(b)-  f(a). 


From  this  and  (3.3.18),  (3.3.20)  can  be  rewritten  as 

-b 


H 


[  f'(x)G(x)  dx  =  (f(b)  -  f(a))  f  g(x)  dx. 

J  a  J  a 

Substituting  this  into  (3.3.19)  and  noting  that  G(a)  =  0  yields 

f  f(x)g(x)  dx  =  f(b)  f  g(x)  dx  -  (f(b)  -  /(a))  f  g(x)  dx, 

J  a  J  a  J  a 

pc  I  pb  pa  ^ 

=  f(a)  I    g(x)  dx  +  f(b)     /    g(x)  dx  -       g(x)  dx 

J a  \J a  J c  j 

=  /(«)/    g(x)dx  +  f(b)  g(x)dx. 

J  a  Jc 

Change  of  Variable 

The  following  theorem  on  change  of  variable  is  useful  for  evaluating  integrals. 

Theorem  3.3.17  Suppose  that  the  transformation  x  =  (p(t)  maps  the  interval  c  < 
t  <  d  into  the  interval  a  <  x  <  b,  with  <p{c)  =  a  and  <p{d)  =  j3,  and  let  f  be  continuous 
on  [a,b\.  Let  <p'  be  integrable  on  [c,d\.  Then 

\    f(x)dx=  /    f(4,(t))<p'(t)dt.  (3.3.21) 

J  a  Jc 

Proof  Both  integrals  in  (3.3.21)  exist:  the  one  on  the  left  by  Theorem  3.2.8,  the  one  on 
the  right  by  Theorems  3.2.8  and  3.3.6  and  the  continuity  of  / ((p(t)).  By  Theorem  3.3.1 1, 
the  function 

Hx)  =  f f(y)dy 


is  an  antiderivative  of  /  on  [a,  b]  and,  therefore,  also  on  the  closed  interval  with  endpoints 
a  and  f).  Hence,  by  Theorem  3.3.14, 


/' 

J  a 


f(x)  dx  =  F(P)  -  F(a).  (3.3.22) 


By  the  chain  rule,  the  function 

G(t)  =  F(<p(t)) 
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is  an  antiderivative  of  / (<p(t))(p' (t)  on  [c,  d],  and  Theorem  3.3.12  implies  that 

jd  fmW(t)  dt  =  G(d)  -  G(c)  =  F(4>(d))  -  F(c/>(c)) 
=  F(P)  -  F(«). 

Comparing  this  with  (3.3.22)  yields  (3.3.21).  TJ 

Example  3.3.2  To  evaluate  the  integral 

/■1/V2 

/  =  /        (1  -  2jc2)(1  -  x2)~1/2dx 
J-1/V2 

we  let 

f(x)  =  (1-2jc2)(1  -x2y1/2,     -l/V2<x  <  1/V2, 

and 

x  =  (p(t)  =  sin/,    -7T/4  <  t  <  it/4. 

Then  </>'(/)  =  cos  /  and 

•  I/V2  i-n/4 


f(x)dx=  I       /(sin/)  cos  f  dt 

/tt/4 
(1-2  sin2  f  )(1  -  sin2  t )"1/2  cos  /  d/. 
-tt/4 


t/4 

(1  -  sin2  t )1/2  =  cos  t ,  -7r/4  <  /  <  7r/4 


and 


(3.3.23)  yields 


1  -2  sin2  /  =  cos  2/, 


/JT, 
-It 


71 IA  ,       sin  2/ 

cos  2/  flf  =   

/4  2 


tt/4 

=  1. 

-tt/4 


Example  3.3.3  To  evaluate  the  integral 

i>5n 


sin  t 

dt, 


2  +  cos  / 

we  take  </>(/)  =  cos  t .  Then  (p'(t)  =  —  sin/  and 

I  = 


riK  <t>'(t)  r* 

/  TT^dt  =  -\  fmW(t)dt, 
Jo    2  +  </>(/)  y0 


where 

/w  =  1 


(3.3.23) 


2  +  x 


Section  3.3  Properties  of  the  Integral  147 


Therefore,  since  0(0)  =  1  and  <P{5tt)  =  —  1, 


r 


dx  ' 


2  +  x 


log(2  +  x) 


=  log3. 
l 


These  examples  illustrate  two  ways  to  use  Theorem  3.3.17.  In  Example  3.3.2  we  evalu- 
ated the  left  side  of  (3.3.21)  by  transforming  it  to  the  right  side  with  a  suitable  substitution 
x  =  (p(t),  while  in  Example  3.3.3  we  evaluated  the  right  side  of  (3.3.21)  by  recognizing 
that  it  could  be  obtained  from  the  left  side  by  a  suitable  substitution. 

The  following  theorem  shows  that  the  rule  for  change  of  variable  remains  valid  under 
weaker  assumptions  on  /  if  <p  is  monotonic. 

Theorem  3.3.18  Suppose  that  <p'  is  integrable  and  <p  is  monotonic  on  [c,  d],  and  the 
transformation  x  =  (p(t)  maps  [c,  d]  onto  [a,b\.  Let  f  be  bounded  on  [a,  b].  Then 

g(t)  =  /mw(t) 

is  integrable  on  [c,  d]  if  and  only  if  f  is  integrable  over  [a,b],  and  in  this  case 

-b  rd 


f  f(X)dx=  f  fmwmdt. 

J  a  J  c 


Proof  We  consider  the  case  where  /  is  nonnegative  and  (p  is  nondecreasing,  and  leave 
the  the  rest  of  the  proof  to  you  (Exercises  3.3.20  and  3.3.21). 

First  assume  that  (p  is  increasing.  We  show  first  that 

I   f(x)dx=  /    f(4>(t))4>'(t)dt.  (3.3.24) 

J  a  J  c 

Let  P  =  {to,  h, . . . ,  tn}  be  a  partition  of  [c.  d]  and  P  =  {xo,  x\, . . . ,  xn}  with  Xj  =  (pit/) 
be  the  corresponding  partition  of  [a,b].  Define 


=  sup 

{4>'(t) 

tj-i  St  Stj}, 

Uj 

=  inf 

[<P\t)  | 

tj-i  St  Stj}, 

Mj 

=  sup 

{/(*) 

|  Xj-i  S  x  S  Xj  } , 

and 

Mj  =  sup  {f(cf>(t))cf>'(t)  |  tj-i  St  Stj}. 
Since  <p  is  increasing,  Uj  >  0.  Therefore, 

0  <  Uj  <  <p'(t)  s  Uj,    tj-i  St  Stj. 
Since  /  is  nonnegative,  this  implies  that 

o  <  f(4>(t))uj  s  fmw(t)  s  fmwj,  tj^ 

Therefore, 

Mj  Uj  S  M '  j  S  MjUj, 
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which  implies  that 
where 

Now  consider  the  upper  sums 


MJ  =  MJPj> 
uj  <  pj  <  Uj. 


S(P)  =  £  Mj(tj  -  tj-i)    and    S(P)  =  £  MjQcj  -  xy-i). 

.7=1  .7=1 

From  the  mean  value  theorem, 

xj  -Xj-i  =  4>(fj)  ~  0(0-1)  =  0'(ry)(O  _  0-i). 
where  fy_ i  <  ry  <  tj,  so 


Uj<4>'<rj)<Uj. 


From  (3.3.25),  (3.3.27),  and  (3.3.28 


SOP)  -  S(P)  =      Mjipj  -  4>'{xj))(tj  -  tj-,). 

7=1 


(3.3.25) 
(3.3.26) 

(3.3.27) 

(3.3.28) 
(3.3.29) 

(3.3.30) 


Now  suppose  that  \f(x)\  <  M,a  <  x  <  b.  Then  (3.3.26),  (3.3.29),  and  (3.3.30)  imply 
that 

n 

\S(P)  -  S(P)\  <  M  J2(Uj  ~  ujKtj  -  tj-!). 

7=1 

The  sum  on  the  right  is  the  difference  between  the  upper  and  lower  sums  of  <p'  over  P . 
Since  (p'  is  integrable  on  [c,  d],  this  can  be  made  as  small  as  we  please  by  choosing  ||  P  || 
sufficiently  small  (Exercise  3.2.4). 

From  (3.3.28),  H^H  <  K\\T\\  if  \</>'(t)\  <  K,c  <  t  <  d.  Hence,  Lemma  3.2.4  implies 
that 


S(P) 


J  a 


x)  dx 


<  -  and 

3 


s(P)-  J  fmw(f)dt 


<  -  (3.3.31) 


if  ||  P  ||  is  sufficiently  small.  Now 


[  f(x)dx-  f  f(<p(t))<P'(t)dt  <  [  f(x)dx-S(P) 

J  a  Jc  J  a 


+  \S(P)-S(P)\ 


+ 


s(p)-  fe  fmw(t)dt 


Choosing  P  so  that  \S(P)  -  S(P\  <  e/3  in  addition  to  (3.3.31)  yields 


f  f(x)dx-  f  f{<t>{t))<t>'{t)dt 

J  a  Jc 

Since  e  is  an  arbitrary  positive  number,  this  implies  (3.3.24). 


<  e. 
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If  (p  is  nondecreasing  (rather  than  increasing),  it  may  happen  that  Xj-\  =  Xj  for  some 
values  of  j ;  however,  this  is  no  real  complication,  since  it  simply  means  that  some  terms  in 
S(P)  vanish. 

By  applying  (3.3.24)  to  —  /,  we  infer  that 

f  f(x)dx=  f  fmW(t)dt,  (3.3.32) 

J  a  Jc 


since 


and 


\  (-/)(*)  dx  =  -     f{x)  dx 

J  a  J  a 

j\-f(Ht)<P'(t))dt  =  -Jd f(<p(t))<p'(t)dt. 


Now  suppose  that  /  is  integrable  on  [a,  b].  Then 

eb  ~Tb  rb 


ft)  fO  f-C 

\  fix)  dx  =      fix)  dx  = 

Ja  Ja  Ja 


f  ix)  dx, 


by  Theorem  3.2.3.  From  this,  (3.3.24),  and  (3.3.32), 

f  f(4>(t))4>'(t)dt  =  f  f(<p(tW(t)dt  =  [b  f(x)dx. 

J  c  Jc  Ja 

This  and  Theorem  3.2.5  (applied  to  / {cp(t))<p'  (t))  imply  that  / (cp(t))<p'  (t)  is  integrable  on 
[c,  d]  and 

f  f(x)dx=  [   f(<t>(t))<f>'(t)dt.  (3.3.33) 

J  a  Jc 

A  similar  argument  shows  that  if  / (<p(t))<p'  (t)  is  integrable  on  [c,  d],  then  /  is  integrable 
on  [a,  b],  and  (3.3.33)  holds.  TJ 

3.3  Exercises 


1.  Prove  Theorem  3.3.2. 

2 .  Prove  Theorem  3 . 3 . 3 . 

3.  Can  |  f\  be  integrable  on  [a,  b]  if  /  is  not? 

4.  Complete  the  proof  of  Theorem  3.3.6.  Hint:  The  partial  proof  given  above  implies 
that  if  mi  and  m-i  are  lower  bounds  for  f  and  g  respectively  on  [a,b],  then 

if  —  m\)ig  —  mf)  is  integrable  on  [a,  b], 

5.  Prove:  If  /  is  integrable  on  [a,  b]  and  |  / (x)\  >  p  >  0  for  a  <  x  <  b,  then  1//  is 
integrable  on  [a,  b] 
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6.  Suppose  that  /  is  integrable  on  [a,  b]  and  define 

(/(*)      if/(*)>0,  (0  if/(x)>0, 

f+(x)  =  and    f~(x)  = 

(0  if/(x)<0,  [  f(x)  if/(x)<0. 

Show  that  /+  and  /"are  integrable  on  [a,  b],  and 

/>6  />6  />6 

/    f(x)dx=  I    f+(x)dx+  /  f~(x)dx. 

Ja  Ja  Ja 

7.  Find  the  weighted  average  u  of  u(x)  over  [a,  ft]  with  respect  to  v,  and  find  a  point  c 
in  [a.  /?]  such  that  m(c)  =  u. 

(a)        =  x,         v(x)  =  x,      [a,b]  =  [0, 1] 


X, 

v(x)  =  X, 

sinx, 

v(x)  =  X2 

x2, 

v(x)  =  ex 

8.  Prove  Theorem  3.3.9. 

9.  Show  that 

pc  rb  rc 

I    f  (x)  dx  =  I    f  (x)  dx  +  I    f  (x)  dx 

Ja  Ja  Jb 

for  all  possible  relative  orderings  of  a,  b,  and  c,  provided  that  /  is  integrable  on  a 
closed  interval  containing  them. 

10.  Prove:  If  /  is  integrable  on  [a,  b]  and  a  =  do  <  a.\  <  ■  ■  ■  <  an  =  b,  then 

rb  rai  pa2  pa„ 

I    f(x)dx  =  I     f(x)dx+  I     f(x)dx  +  h  /  f(x)dx. 

Ja  Jao  J a\  J an—\ 

11.  Suppose  that  /  is  continuous  on  [a,  b]  and  P  =  {xo,  x\, . . . ,  xn}  is  a  partition  of 
[a,b].  Show  that  there  is  a  Riemann  sum  of  /  over  P  that  equals      f  (x)  dx. 

12.  Suppose  that  /'  exists  and  <  M  on  [a,  b].  Show  that  any  Riemann  sum  a 
of  /  over  any  partition  P  of  [a,  b]  satisfies 


,4 

a  —  f    f  (x)  dx 

J  a 


<  M(b-a)\\P\ 


HINT:  See  Exercise  3.3.11. 


13.  Prove:  If  /  is  integrable  and  fix)  >  0  on  [a,  b],  then      f(x)  dx  >  0,  with  strict 
inequality  if  /  is  continuous  and  positive  at  some  point  in  [a,b]. 

14.  Complete  the  proof  of  Theorem  3.3.1 1 . 

15.  State  theorems  analogous  to  Theorems  3.3. 10  and  3.3.11  for  the  function 

G(x)=  f  f(t)dt, 

J  X 

and  show  how  your  theorems  can  be  obtained  from  them. 
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16.  The  symbol  f  f(x)  dx  denotes  an  antiderivative  of  /.  A  plausible  analog  of  The- 
orem 3.3.1  would  state  that  if  /  and  g  have  antiderivatives  on  [a,  b],  then  so  does 
f  +  g,  which  is  true,  and 

f(f+  g)(x)  dx  =  J  f{x)  dx  +  J  g(x)  dx.  (A) 

However,  this  is  not  true  in  the  usual  sense. 

(a)  Why  not? 

(b)  State  a  correct  interpretation  of  (A). 

17.  (See  Exercise  3.3.16.)  Formulate  a  valid  interpretation  of  the  relation 

J (cf)(x)  dx  =  c  j  f{x)  dx    (c  +  0). 

Is  your  interpretation  valid  if  c  =  0? 

18.  (a)   Let  be  integrable  on  [a,  b].  Show  that 

m  =  J2L-rl(b-aY  +  ~  /  fin+1)(t)(b-t)ndt. 

HINT:  Integrate  by  parts  and  use  induction. 
(b)   What  is  the  connection  between  (a)  and  Theorem  2.5.5? 

19.  In  addition  to  the  assumptions  of  Theorem  3.3.16,  suppose  that  /  (a)  =  0,  /  ^  0, 
and  g(x)  >  0  (a  <  x  <  b).  Show  that  there  is  only  one  point  c  in  [a,  b]  with  the 
property  stated  in  Theorem  3.3.16.  Hint:  Use  Exercise  3.3.13. 

20.  Assuming  that  Theorem  3.3.18  is  true  under  the  additional  assumption  that  /  is 
nonnegative  on  [a,  b],  show  that  it  is  true  without  this  assumption. 

21.  Assuming  that  the  conclusion  of  Theorem  3.3.18  is  true  if  (j)  is  nondecreasing,  show 
that  it  is  true  if  <p  is  nonincreasing.  Hint:  Use  Exercise  3.1.6. 

22.  Suppose  g'  is  integrable  and  /  is  continuous  on  [a,  b\.  Show  that  J  fix)  dg(x) 
exists  and  equals  f£  f(x)g'(x)  dx. 

23.  Suppose  /  and  g"  are  bounded  and  fg'  is  integrable  on  [a,  b].  Show  that  f°  f(x)dg{x) 
exists  and  equals      f  (x)g' (x)  dx.  Hint:  Use  Theorem  2.5  A. 

3.4  IMPROPER  INTEGRALS 

So  far  we  have  confined  our  study  of  the  integral  to  bounded  functions  on  finite  closed 
intervals.  This  was  for  good  reasons: 

•  From  Theorem  3.1.2,  an  unbounded  function  cannot  be  integrable  on  a  finite  closed 
interval. 


152    Chapter  3  Integral  Calculus  of  Functions  of  One  Variable 


•  Attempting  to  formulate  Definition  3.1.1  for  a  function  defined  on  an  infinite  or  semi- 
infinite  interval  would  introduce  questions  concerning  convergence  of  the  resulting 
Riemann  sums,  which  would  be  infinite  series. 

In  this  section  we  extend  the  definition  of  integral  to  include  cases  where  /  is  unbounded 
or  the  interval  is  unbounded,  or  both. 

We  say  /  is  locally  integrable  on  an  interval  /  if  /  is  integrable  on  every  finite  closed 
subinterval  of  / .  For  example, 

f(x)  =  sinx 

is  locally  integrable  on  (— oo,  oo); 

g(x)  =    ,  1 

x(x  —  1) 

is  locally  integrable  on  (— oo,  0),  (0,  1),  and  (1,  oo);  and 

h(x)  =  \fx 

is  locally  integrable  on  [0,  oo). 

Definition  3.4.1  If  /  is  locally  integrable  on  [a,  b),  we  define 

f  f(x)dx=  lim   f  f(x)dx  (3.4.1) 

J  a    '  c->b-  J  a 

if  the  limit  exists  (finite).  To  include  the  case  where  b  =  oo,  we  adopt  the  convention  that 
oo—  =  00.  ■ 

The  limit  in  (3.4.1)  always  exists  if  [a,  b)  is  finite  and  /  is  locally  integrable  and  bounded 
on  [a,  b).  In  this  case,  Definitions  3.1.1  and  3.4.1  assign  the  same  value  to  f£  f(x)  dx  no 
matter  how  f(b)  is  defined  (Exercise  3.4.1).  However,  the  limit  may  also  exist  in  cases 
where  b  =  oo  or  b  <  oo  and  /  is  unbounded  as  x  approaches  b  from  the  left.  In  these 
cases,  Definition  3.4.1  assigns  a  value  to  an  integral  that  does  not  exist  in  the  sense  of  Def- 
inition 3.1.1,  and  f  f(x)  dx  is  said  to  be  an  improper  integral  that  converges  to  the  limit 
in  (3.4.1).  We  also  say  in  this  case  that  /  is  integrable  on  [a,  b)  and  that  f  f  (x)  dx  exists. 
If  the  limit  in  (3.4.1)  does  not  exist  (finite),  we  say  that  the  improper  integral  f  (x)  dx 
diverges,  and  /  is  nonintegrable  on  [a,b).  In  particular,  if  linic^-  fa  fix)  dx  =  ±oo, 
we  say  that      f(x)dx  diverges  to  ±oo,  and  we  write 

rb  rb 

I    f(x)dx  =  oo    or     /    f(x)dx  =  —  oo, 

J  a  J  a 


whichever  the  case  may  be. 

Similar  comments  apply  to  the  next  two  definitions. 
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Definition  3.4.2  If  /  is  locally  integrable  on  (a,  b],  we  define 

pb  pb 

I    f(x)dx=   lim    /  f(x)dx 

J  a  C^a+Jc 

provided  that  the  limit  exists  (finite).  To  include  the  case  where  a  =  —  oo,  we  adopt  the 
convention  that  — oo+  =  — oo. 

Definition  3.4.3  If  /  is  locally  integrable  on  (a,  b),  we  define 

pb  pa  pb 

I    f(x)dx=  I    f(x)dx+  I  f(x)dx, 

Ja  Ja  Ja 

where  a  <  a  <  b,  provided  that  both  improper  integrals  on  the  right  exist  (finite).  ■ 

The  existence  and  value  of  f%  f(x)  dx  according  to  Definition  3.4.3  do  not  depend  on 
the  particular  choice  of  a  in  (a,  b)  (Exercise  3.4.2). 

When  we  wish  to  distinguish  between  improper  integrals  and  integrals  in  the  sense  of 
Definition  3. 1 . 1 ,  we  will  call  the  latter  proper  integrals. 

In  stating  and  proving  theorems  on  improper  integrals,  we  will  consider  integrals  of 
the  kind  introduced  in  Definition  3.4.1.  Similar  results  apply  to  the  integrals  of  Defini- 
tions 3.4.2  and  3.4.3.  We  leave  it  to  you  to  formulate  and  use  them  in  the  examples  and 
exercises  as  the  need  arises. 


Example  3.4.1  The  function 


/ (x)  =  2x  sin  cos  — 

X  X 

is  locally  integrable  and  the  derivative  of 

F(x)  =  x2  sin  — 

x 

on  [—2/n,  0).  Hence, 

1  4 


7  1 

/ (x)  dx  =  x  sin  — 

2/jt  X 


=  c2  sin  — I  

2/7Z  c  * 


and 


f°                           (         1       4  \  4 
/       f(x)dx=  lim    c2sin-H   =  — , 

J-2/yr  c^0-\  C  TT2) 


according  to  Definition  3.4.1.  However,  this  is  not  an  improper  integral,  even  though  / (0) 
is  not  defined  and  cannot  be  defined  so  as  to  make  /  continuous  at  0.  If  we  define  / (0) 
arbitrarily  (say  /(0)  =  10),  then  /  is  bounded  on  the  closed  interval  [—2/n,  0]  and  con- 
tinuous except  at  0.  Therefore,  fH.2/K  f{x)dx  exists  and  equals  4/jt2  as  a  proper  integral 
(Exercise  3.4.1),  in  the  sense  of  Definition  3.1.1. 
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Example  3.4.2  The  function 

f(x)  =  (l-x)-P 
is  locally  integrable  on  [0,  1)  and,  if  p  ^  1  and  0  <  c  <  1, 

(1  -c)-p+l  -  1 


f{\-x)-pdx^  0  X)  "+1 
Jo 


p-1 


Hence, 


For  p  =  1, 


Hence, 


p-l 


P)-\  P<h 


lim   [  (l-x)-pdx  =  r1 
im   /   (1  —  x)~l  dx  =  —  lim  log(l  —  c)  =  00. 

"►Wo  e^!- 

f  (1  -x)-p  dx  =  {' 
Jo  (( 


](l-p)-\  P<1, 

1 OO,  /?  >  1. 


Example  3.4.3  The  function 

f(x)  =  x-p 

is  locally  integrable  on  [1,  00)  and,  if  p  ^  1  and  c  >  1, 


-/>+! 


-p+1 


<  _  c-P  +  i  _  1 

1  ~  rpTl 


Hence, 


For  p  =  1, 


Hence, 


lim  r,-^,={^-1)-1' 

e^oo  (OO,  /?  <  1. 


lim  /    x  1 


dx  =  lim  logc  =  00. 


/OO 
dx  = 

Example  3.4.4  If  1  <  c  <  00,  then 


(p-l)-1,  P>1, 
00,  /?  <  1. 


Hence, 


so 


fc  I       I  fc  1  1 

/  —\og—dx  =  —\  —\ogxdx  =  —-(logxY 
J 1  x      x  J\  x  2 

r  1    1  , 

lim  /    —  log  —  dx  =  — 00, 

P    1  ,         1  W 

/     —  log  —  dx  =  —00. 
Ji     x  x 


=  -2dog 
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Example  3.4.5  The  function  / (x)  =  cos  x  is  locally  integrable  on  [0,  oo)  and 

lim  /    cos  x  dx  =  lim  sin  c 

c— >oo  Jq  c— >oo 

does  not  exist;  thus,  /0°°  cos  x  dx  diverges,  but  not  to  ±oo. 

Example  3.4.6  The  function  f(x)  =  log*  is  locally  integrable  on  (0,  1],  but  un- 
bounded as  x      0+.  Since 


lim   /    logx  dx  =  lim  (xlogx—x) 

c-*0+  Jc  c^0  + 


-1  —  lim  (clog c  —  c)  =  —I, 

c~»0+ 


Definition  3.4.2  yields 

log  x  dx  =  —  1 . 


/' 

Jo 


Example  3.4.7  In  connection  with  Definition  3.4.3,  it  is  important  to  recognize  that 
the  improper  integrals  f"  f  (x)  dx  and  f  f  (x)  dx  must  converge  separately  for  f(x)dx 
to  converge.  For  example,  the  existence  of  the  symmetric  limit 

fR 

lim   /  f(x)dx, 
R^oo  J„R 

which  is  called  the  principal  value  of  f_    f  (x)  dx,  does  not  imply  that  /_    / (x)  dx 
converges;  thus, 


lim   /    x  dx  =  lim  0  =  0, 


'  J -J 

but  /0°°  x  dx  and        x  dx  diverge  and  therefore  so  does        x  dx . 


Theorem  3.4.4  Suppose  that  f\,  f2,  . . . ,  fn  are  locally  integrable  on  [a,  b)  and  that 
fa  fi(x)  dx,  f  f2(x)  dx,  fn(x)  dx  converge.  Let  c\,  c2,        c„  be  constants. 

Then  J^{c\  f  +  c2f\  H  +  cnfn)(x)  dx  converges  and 


pb  pb 

I  (cifi+c2f2-\  \-c„f„)(x)  dx  =  a  /    fi(x)dx+c2  f2(x)dx 

Ja  J  a  Ja 

H  /  fn(x)dx. 

J  a 

Proof   If  a  <  c  <  b,  then 

/   (ci/i  +  C2/2  H  h  c„f„)(x)  dx  =  ci  /    fi(x)  dx  +  c2  /    f2(x)  dx 

J  a  J  a  J  a 

H  /  fn(x)dx, 

Ja 

by  Theorem  3.3.3.  Letting  c  — >  b—  yields  the  stated  result.  H 
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Improper  Integrals  of  Nonnegative  Functions 

The  theory  of  improper  integrals  of  nonnegative  functions  is  particularly  simple. 

Theorem  3.4.5  Iff  is  nonnegative  and  locally  integrable  on  [a,  b),  then  f(x)dx 
converges  if  the  function 


r 

J  a 


F(x)=  /  f(t)dt 


is  bounded  on  [a,  b),  and  f  f  (x)  dx  =  oo  if  it  is  not.  These  are  the  only  possibilities,  and 


b 

f(t)dt  =    sup  F(x) 


a<x<b 

in  either  case. 

Proof  Since  F  is  nondecreasing  on  [a,  b),  Theorem  2.1.9(a)  implies  the  conclusion. 
TJ 

We  often  write 


L 


b 

f  (x)  dx  <  oo 


to  indicate  that  an  improper  integral  of  a  nonnegative  function  converges.  Theorem  3.4.5 
justifies  this  convention,  since  it  asserts  that  a  divergent  integral  of  this  kind  can  only  di- 
verge to  oo.  Similarly,  if  /  is  nonpositive  and  f  f  (x)  dx  converges,  we  write 


L 


b 

f  (x)  dx  >  — oo 


because  a  divergent  integral  of  this  kind  can  only  diverge  to  — oo.  (To  see  this,  apply 
Theorem  3.4.5  to  —  /.)  These  conventions  do  not  apply  to  improper  integrals  of  functions 
that  assume  both  positive  and  negative  values  in  (a,b),  since  they  may  diverge  without 
diverging  to  ±oo. 

Theorem  3.4.6  (Comparison  Test)  Iff  and  g  are  locally  integrable  on  [a,  b) 
and 

0  <  /(*)  <  g(x),     a<x  <b,  (3.4.2) 

then 

r>b  fb 

(a)  /    f(x)  dx  <  oo    if     I    g(x)  dx  <  oo 

J  a  J  a 

and 

pb  pb 

(b)  /    g(x)dx  =  oo    if     I    f(x)dx  =  oo. 

J  a  J  a 
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Proof    (a)  Assumption  (3.4.2)  implies  that 

f  f(f)dt<  f  g{t)dt,  a<x<b 

J  a  J  a 


(Theorem  3.3.4),  so 


sup     /    f(t)dt  <    sup    /  g(t)dt. 

i<x<b  Ja  a<x<b  Ja 


If     g(x)  dx  <  oo,  the  right  side  of  this  inequality  is  finite  by  Theorem  3.4.5,  so  the  left 
side  is  also.  This  implies  that  f  f  (x)  dx  <  oo,  again  by  Theorem  3.4.5. 

(b)  The  proof  is  by  contradiction.  If     g(x)  dx  <  oo,  then  (a)  implies  that  f  f  (x)  dx  < 

oo,  contradicting  the  assumption  that  f  f  (x)  dx  =  oo.  TJ 

The  comparison  test  is  particularly  useful  if  the  integrand  of  the  improper  integral  is 
complicated  but  can  be  compared  with  a  function  that  is  easy  to  integrate. 


Example  3.4.8  The  improper  integral 

2  +  sinTTX 


Jo 


(l-x)P 
converges  if  p  <  1,  since 


dx 


2  +  sinxx  3 

0<   <  ,    0  <  x  <  1, 

(1  -x)p    ~  (1  -x)p 


and,  from  Example  3.4.2, 

*x  3dx 


/ 

Jo 


!o  (l-x)P 
However,  /  diverges  if  p  >  1,  since 


<  oo,     p  <  1. 


1  2  +  sin  jtx 

0<   <  ,    0  <  x  <  1, 

(1  -x)P  ~    (1  -X)P 

and 

r1  dx 

If  /  is  any  function  (not  necessarily  nonnegative)  locally  integrable  on  [a,  b),  then 
I    f(x)dx=  I     f(x)dx+  J  f{x)dx 

J  a  J  a  J a\ 

if  ct\  and  c  are  in  [a,b).  Since  f^1  f(x)dx  is  a  proper  integral,  on  letting  c  — ►  b—  we 

conclude  that  if  either  of  the  improper  integrals  fa  f  (x)  dx  and  f  f(x)dx  converges 
then  so  does  the  other,  and  in  this  case 

nb  [>a\  nb 

I    f(x)dx=  I     f(x)dx+  I  f(x)dx. 

Ja  Ja  J  a\ 
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This  means  that  any  theorem  implying  convergence  or  divergence  of  an  improper  integral 
f(x)dx  in  the  sense  of  Definition  3.4.1  remains  valid  if  its  hypotheses  are  satisfied 
on  a  subinterval  [ci\ ,  b)  of  [a,  b)  rather  than  on  all  of  [a,  b).  For  example,  Theorem  3.4.6 
remains  valid  if  (3.4.2)  is  replaced  by 

0  <  fix)  <  g(x),  ai<x<b, 

where  a\  is  any  point  in  [a,  b). 

From  this,  you  can  see  that  if  f(x)  >  0  on  some  subinterval  [a\,  b)  of  [a,  b),  but  not 
necessarily  for  all  x  in  [a,  b),  we  can  still  use  the  convention  introduced  earlier  for  positive 
functions;  that  is,  we  can  write  f  f  (x)  dx  <  oo  if  the  improper  integral  converges  or 
f  (x)  dx  =  oo  if  it  diverges. 

Example  3.4.9  If  p  >  0,  then 

x~p  <  (x-l)P(2  +  smx)  < 
2    -       (x-l/3)2P       ~  X 

for  x  sufficiently  large.  Therefore,  Theorem  3.4.6  and  Example  3.4.3  imply  that 

(x  -  l)p(2  +  sinx) 


h         (x  -  l/3)2^ 
converges  if  p  >  1  or  diverges  if  p  <  1 . 


dx 


Theorem  3.4.7  Suppose  that  f  and  g  are  locally  integrable  on  [a,  b),  g(x)  >  0  and 
fix)  —  0  on  some  subinterval  [ai,  b)  of[a,b),  and 

fix) 

lim  — -  =  M.  (3.4.3) 

x^b-  gix) 

(a)  I/O  <  M  <  oo,  then  f£  fix)  dx  and     gix)  dx  converge  or  diverge  together. 

(b)  If  M  =  oo  and  f  g(x)  dx  =  oo,  then  f  f  (x)  dx  =  oo. 

(c)  If  M  =  0  and     g(x)  dx  <  oo,  then      fix)  dx  <  oo. 

Proof    (a)  From  (3.4.3),  there  is  a  point  a-i  in  [a\,  b)  such  that 

M      fix)  3M 

0  <  —  <  — - —  <  ,    czt  <  x  <  b, 

2       g(x)  2 

and  therefore 

M  3M 

y?W  <  /(*)  <  —Six),     a2<x  <b.  (3.4.4) 
Theorem  3.4.6  and  the  first  inequality  in  (3.4.4)  imply  that 

pb  pb 

I    g(x)dx  <  oo    if      /    f(x)dx  <  oo. 
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Theorem  3.4.6  and  the  second  inequality  in  (3.4.4)  imply  that 

pb  pb 

I    f(x)dx  <  oo    if      /    g(x)dx  <  oo. 

Therefore,  f    f(x)dx  and  f   g(x)  dx  converge  or  diverge  together,  and  in  the  latter  case 
they  must  diverge  to  oo,  since  their  integrands  are  nonnegative  (Theorem  3.4.5). 

(b)  If  M  =  oo,  there  is  a  point  a2  in  [ci\ ,  b)  such  that 

/(*)  >  g(x),  a2<x<b, 

so  Theorem  3.4.6(b)  implies  that  f  fix)  dx  =  oo. 

(c)  If  M  =  0,  there  is  a  point  a2  in  [ai,  b)  such  that 

fix)  <  g(x),  a2<x<b, 

so  Theorem  3.4.6(a)  implies  that  f  fix)  dx  <  oo.  TJ 

The  hypotheses  of  Theorem  3.4.7(b)  and  (c)  do  not  imply  that     fix)  dx  and     g(x)  dx 
necessarily  converge  or  diverge  together.  For  example,  if  b  =  oo,  then  / (x)  =  l/x 
and  g(x)  =  l/x2  satisfy  the  hypotheses  of  Theorem  3.4.7(b),  while  fix)  =  l/x2  and 
gix)  =  l/x  satisfy  the  hypotheses  of  Theorem  3.4.7(c).  However,  f^  l/xdx  =  oo, 
while       l/x2  dx  <  oo. 

Example  3.4.10  Let  f(x)  =  (1  +  x)~p  and  g{x)  =  x~p.  Since 

fix) 
lim  — -  =  1 

x^°°  gix) 

and  /j00  x~p  dx  converges  if  p  >  1  or  diverges  if  p  <  1  (Example  3.4.3),  Theorem  3.4.7 
implies  that  the  same  is  true  of 

poo 

J  (l+x)-pdx. 

Example  3.4.11  The  function 

fix)  =  x-"(l  +  x)-i 
is  locally  integrable  on  (0,  oo).  To  see  whether 

/>oo 

/  =  /     x~p(l  +  x)-q  dx 
Jo 

converges  according  to  Definition  3.4.3,  we  consider  the  improper  integrals 

pi  poo 

h  =  J    x~p(l  +  x)~q  dx    and    I2  =  J     x~p(l  +  x)~q  dx 
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separately.  (The  choice  of  1  as  the  upper  limit  of  I\  and  the  lower  limit  of  I2  is  completely 
arbitrary;  any  other  positive  number  would  do  just  as  well.)  Since 

lim  =   lim  (1  +  xTq  =  1 

+  X~P        x-^Q  + 

and 


/ 

Jo 


00,  p  >  1, 


Theorem  3.4.7  implies  that  I\  converges  if  and  only  if  p  <  1.  Since 
lim  _^*L  =  lim  (j  +  x)-ixi  =  1 

x— >oo  x   P  Q  x—>oo 


and 


r  x-p-qdx  =  \{p  +  q-l)-\  p+q>l, 
1  (oo,  p+?<l, 


Theorem  3.4.7  implies  that  72  converges  if  and  only  if  p  +  q  >  1.  Combining  these 
results,  we  conclude  that  I  converges  according  to  Definition  3.4.3  if  and  only  if  p  <  1 
and  p  +  q  >  1. 


Absolute  Integrability 

Definition  3.4.8  We  say  that  /  is  absolutely  integrable  on  [a,  b)  if  /  is  locally  inte- 
grable  on  [a,  b)  and  f  \f(x)\  dx  <  00.  In  this  case  we  also  say  that  f  f  (x)  dx  converges 
absolutely  or  is  absolutely  convergent. 


Example  3.4.12  If  /  is  nonnegative  and  integrable  on  [a,b),  then  /  is  absolutely 
integrable  on  [a,  b),  since  \f\  =  f. 


Example  3.4.13  Since 


xP 


xP 


and  /j00  x  p  dx  <  00  if  p  >  1  (Example  3.4.3),  Theorem  3.4.6  implies  that 


r 


sinx| 
xP 


dx  <  00,    p  >  1; 


that  is,  the  function 


/(*)  = 


x  p 


is  absolutely  integrable  on  [1,  00)  if  p  >  1.  It  is  not  absolutely  integrable  on  [1,  00)  if 
p  <  I.  To  see  this,  we  first  consider  the  case  where  p  =  1 .  Let  k  be  an  integer  greater 
than  3.  Then 
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-kn  ,*«  |sinjc| 

■   flX 


/■K3r  I  sinxl  ,  /" 
/  — —  dx  > 

Jl  x  Jti 


X 


>     >  —  /  I  sinxl  dx. 


But 

»(y  +  l)jr 


so  (3.4.5)  implies  that 


—    rU+D*  |sinJC| 

=       /  —T~ dx  (3.4.5) 

!  /■(„ 
^  (7  +  l)n  JJJT 

/  |  sinx|  dx  =  I    smxdx  =  2 

J  jn  JO 

[k*  |  sinxl  J  2^1 
;   dx  >  —  >   . 


However, 

1         fJ+2  dx 

-r>  /       — .  7=1,2,..., 


7  + 

so  (3.4.6)  implies  that 


c kn  i        „i  ry  k  1 


I  sin x |  2  ^  tfcc 

Ji       x  71  mi  x 

2  fk+1  dx      2  k+1 
=  -  /       —  =  -log^— 

7T  J2  X  TV  2 


Since  lim^oo  log[(/c  +  l)/2]  =  oo,  Theorem  3.4.5  implies  that 

sin  x  I 


dx  =  oo. 


x 


Now  Theorem  3.4.6(b)  implies  that 


r 


sin  x 


xP 


(3.4.6) 


dx  =  oo,     p  <  1.  (3.4.7) 


Theorem  3.4.9      /  is  locally  integrable  on  [a,b)  and  f  \f(x)\dx  <  oo,  then 
f  (x)  dx  converges;  that  is,  an  absolutely  convergent  integral  is  convergent. 

Proof  If 

g{x)  =  \f(x)\-f{x), 
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then 

0  <  g(x)  <  2\f(x)\ 

and  f  g(x)  dx  <  oo,  because  of  Theorem  3.4.6  and  the  absolute  integrability  of  /.  Since 

f  =  \f\-g, 

Theorem  3.4.4  implies  that  f%  f  (x)  dx  converges.  H 
Conditional  Convergence 

We  say  that  /  is  nonoscillatory  at  b—  (=  oo  if  b  =  oo)  if  /  is  defined  on  [a,  b)  and 
does  not  change  sign  on  some  subinterval  [a\ ,  b)  of  [a,  b).  If  /  changes  sign  on  every 
such  subinterval,  /  is  oscillatory  at  b—.  For  a  function  that  is  locally  integrable  on  [a,  b) 
and  nonoscillatory  at  b—,  convergence  and  absolute  convergence  of  f  (x)  dx  amount 
to  the  same  thing  (Exercise  3.4.16),  so  absolute  convergence  is  not  an  interesting  concept 
in  connection  with  such  functions.  However,  an  oscillatory  function  may  be  integrable, 
but  not  absolutely  integrable,  on  [a,  b),  as  the  next  example  shows.  We  then  say  that  /  is 
conditionally  integrable  on  [a,  b),  and  that      f{x)  dx  converges  conditionally. 

Example  3.4.14  We  saw  in  Example  3.4.13  that  the  integral 

f°°  sinx  , 

[(P)  =  /  — IT  dx 
Ji  xP 

is  not  absolutely  convergent  if  0  <  p  <  I.  We  will  show  that  it  converges  conditionally  for 
these  values  of  p. 

Integration  by  parts  yields 

fc  sinx  ,       —cose  fc  cosx  , 

I   dx  =  hcosl-z?/  — —  dx.  (3.4.8) 

Ji      xP  CP  Ji  XP+1 

Since 

1 


cp+i 


< 


cp  +  l 


and  x~p~l  dx  <  oo  if  p  >  0,  Theorem  3.4.6  implies  that  x~p~l  cos*  is  absolutely 
integrable  [1 ,  oo)  if  p  >  0.  Therefore,  Theorem  3.4.9  implies  that  x~p~l  cos  x  is  integrable 
[1,  oo)  if  p  >  0.  Letting  c  —>  oo  in  (3.4.8),  we  find  that  I(p)  converges,  and 

/°°  cosx 
p+1  dx    if    p  >  0. 

This  and  (3.4.7)  imply  that  I(p)  converges  conditionally  if  0  <  p  <  1.  ■ 


The  method  used  in  Example  3.4. 14  is  a  special  case  of  the  following  test  for  convergence 
of  improper  integrals. 
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Theorem  3.4.10  (Dirichlet's  Test)  Suppose  that  f  is  continuous  and  its  an- 
tiderivative  F(x)  =  f*  f  (t)  dt  is  bounded  on  [a,  b).  Let  g'  be  absolutely  integrable  on 
[a,  b),  and  suppose  that 


lim  g(x)  =  0.  (3.4.9) 


Then      f(x)g(x)  dx  converges. 


Proof  The  continuous  function  fg  is  locally  integrable  on  [a,  b).  Integration  by  parts 
yields 


(  f(x)g(x)dx  =  F{c)g(c)  -  (  F(x)g'(x)dx,    a<c  <b.  (3.4. 

J  a  J  a 


10) 


Theorem  3.4.6  implies  that  the  integral  on  the  right  converges  absolutely  as  c  — ►  b—,  since 
fa  \s'(x)  \  dx  <  oo  by  assumption,  and 

\F(x)g'(x)\  <  M\g'(x)\, 

where  M  is  an  upper  bound  for  |,F|  on  [a,  b).  Moreover,  (3.4.9)  and  the  boundedness  of  F 
imply  that  \m\c^b-  F(c)g(c)  =  0.  Letting  c  — >  b—  in  (3.4.10)  yields 


pb  fb 

/    f(x)g(x)dx  =  -  F(x)g'(x)dx, 

Ja  Ja 


where  the  integral  on  the  right  converges  absolutely.  TJ 
Dirichlet's  test  is  useful  only  if  /  is  oscillatory  at  b—,  since  it  can  be  shown  that  if  /  is 
nonoscillatory  at  b—  and  F  is  bounded  on  [a,  b),  then  f  \f(x)g(x)\  dx  <  oo  if  only  g  is 
locally  integrable  and  bounded  on  [a,  b)  (Exercise  3.4.14). 


Example  3.4.15  Dirichlet's  test  can  also  be  used  to  show  that  certain  integrals  di- 
verge. For  example, 

xq  sinx  dx 

diverges  if  q  >  0,  but  none  of  the  other  tests  that  we  have  studied  so  far  implies  this.  It 
is  not  enough  to  argue  that  the  integrand  does  not  approach  zero  as  x  — >  oo  (a  common 
mistake),  since  this  does  not  imply  divergence  (Exercise  4.4.31).  To  see  that  the  integral 
diverges,  we  observe  that  if  it  converged  for  some  q  >  0,  then  F(x)  =  f*  xq  sinx  dx 
would  be  bounded  on  [1,  oo),  and  we  could  let 

f{x)  =  xq  sinx    and    g(x)  =  x~q 

in  Theorem  3.4.10  and  conclude  that 


sin  x  dx 

Ji 

also  converges.  This  is  false. 


r 
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The  method  used  in  Example  3.4.15  is  a  special  case  of  the  following  test  for  divergence 
of  improper  integrals. 

Theorem  3.4.11  Suppose  that  u  is  continuous  on  [a,  b)  and  f  u(x)  dx  diverges.  Let 
v  be  positive  and  differentiable  on  [a,  b),  and  suppose  that  \imx^.h-  v(x)  =  oo  and  v' / v2 
is  absolutely  integrable  on  [a,  b).  Then  f£  u(x)v(x)  dx  diverges. 

Proof   The  proof  is  by  contradiction.  Let  /  =  uv  and  g  =  l/v,  and  suppose  that 
u(x)v(x)  dx  converges.  Then  /  has  the  bounded  antiderivative  F(x)  =  f*  u(t)v(t)  dt 
on  [a,  b),  limx^oo  g{x)  =  0  and  g'  =  —v'j v2  is  absolutely  integrable  on  [a,  b).  Therefore, 
Theorem  3.4.10  implies  that      u(x)  dx  converges,  a  contradiction.  TJ 

If  Dirichlet's  test  shows  that  f(x)g(x)  dx  converges,  there  remains  the  question  of 
whether  it  converges  absolutely  or  conditionally.  The  next  theorem  sometimes  answers  this 
question.  Its  proof  can  be  modeled  after  the  method  of  Example  3.4.13  (Exercise  3.4.17). 
The  idea  of  an  infinite  sequence,  which  we  will  discuss  in  Section  4.1,  enters  into  the 
statement  of  this  theorem.  We  assume  that  you  recall  the  concept  sufficiently  well  from 
calculus  to  understand  the  meaning  of  the  theorem. 

Theorem  3.4. 12  Suppose  that  g  is  monotonic  on  [a,  b)  and  f£  g(x)  dx  =  oo.  Let  f 
be  locally  integrable  on  [a,  b)  and 

xJ+i 

\f(x)\dx>p,  j>0, 

j 

for  some  positive  p,  where  {x  j}  is  an  increasing  infinite  sequence  of  points  in  [a,  b)  such 
that  lim/^oo  Xi  =  b  and  Xi+i  —  Xj  <  M,  j  >  0,  for  some  M.  Then 

b 

\f(x)g{x)\  dx  =  oo. 


L 


Change  of  Variable  in  an  Improper  Integral 

The  next  theorem  enables  us  to  investigate  an  improper  integral  by  transforming  it  into 
another  whose  convergence  or  divergence  is  known.  It  follows  from  Theorem  3.3.18  and 
Definitions  3.4.1,  3.4.2,  and  3.4.3.  We  omit  the  proof. 

Theorem  3.4.13  Suppose  that  <p  is  monotonic  and  <p'  is  locally  integrable  on  either 
of  the  half-open  intervals  I  =  [c,  d)  or  (c,  d],  and  let  x  =  (p(t)  map  I  onto  either  of  the 
half-open  intervals  J  =  [a.b)  or  J  =  (a,  b].  Let  f  be  locally  integrable  on  J.  Then  the 
improper  integrals 

pb  t>d 

I    f(x)dx    and     /    /  (0(0)  I0'(OI  dt 

J  a  Jc 
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diverge  or  converge  together,  in  the  latter  case  to  the  same  value.  The  same  conclusion 
holds  ifcp  and  cp'  have  the  stated  properties  only  on  the  open  interval  {a,  b),  the  transfor- 
mation x  =  (p{t)  maps  (c,  d)  onto  (a,  b),  and  f  is  locally  integrable  on  {a,  b). 

Example  3.4.16  To  apply  Theorem  3.4.13  to 

oo 

sinx2  dx, 

we  use  the  change  of  variable  x  =  (p(t)  =  ~Jt,  which  takes  [c,  d)  =  [0,  oo)  into  [a,  b)  = 
[0,  oo),  with  (p'(t)  =  1  /(2VF).  Theorem  3.4.13  implies  that 

f°°       -  1  f°°  sin*  , 

/     sinx  dx  =  -  I  — —  dt, 
Jo  2j0  Vf 

Since  the  integral  on  the  right  converges  (Example  3.4.14),  so  does  the  one  on  the  left. 
Example  3.4.17  The  integral 

/oo 
x~p  dx 

converges  if  and  only  if  p  >  1  (Example  3.4.3).  Defining  </>(f)  =  \jt  and  applying 
Theorem  3.4.13  yields 

/>00  p  1  p  1 

/     x~pdx=      tp\-t~2\dt=  tp~2dt, 
Ji  Jo  Jo 

which  implies  that  L  tq  dt  converges  if  and  only  if  q  >  —1. 

3.4  Exercises 


L 


1.  (a)  Let  /  be  locally  integrable  and  bounded  on  [a,b),  and  let  f(b)  be  defined 
arbitrarily.  Show  that  /  is  properly  integrable  on  [a,  b],  that  f  f  (x)  dx  does 
not  depend  on  / (b),  and  that 

/    f(x)dx=  lim   /  f(x)dx. 

J  a  c~*b-  J  a 

(b)    State  a  result  analogous  to  (a)  which  ends  with  the  conclusion  that 

pb  nb 

I    f(x)dx=   lim    /  f(x)dx. 

J  a  C^a+Jc 


2.     Show  that  neither  the  existence  nor  the  value  of  the  improper  integral  of  Defini- 
tion 3.4.3  depends  on  the  choice  of  the  intermediate  point  a. 
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3. 
4. 


Prove:  If  f  f  (x)  dx  exists  according  to  Definition  3.4.1  or  3.4.2,  then  f  (x)  dx 
also  exists  according  to  Definition  3.4.3. 

Find  all  values  of  p  for  which  the  following  integrals  exist  (i)  as  proper  integrals 
(perhaps  after  defining  /  at  the  endpoints  of  the  interval)  or  (ii)  as  improper  inte- 
grals, (iii)  Evaluate  the  integrals  for  the  values  of  p  for  which  they  converge. 


1  1 ' 

pxp~l  sin  xp~2  cos  —  I  dx 

x  x 


■>  r  ( 

b)         n  (px"-lC-^-x  +  xp~2  sin^\  dx 

pOO  pi  p 

)  /     e~px  dx     (d)  /    x~P  dx     (e)  / 
Jo  Jo  Jo 

aluate 

/>00  />C 

)  \     e~xxndx    (n  =  0,1,...)       (b)  / 
Jo  Jo 

x  dx 


X2  +  1 


cos  x     sin  x 


dx 


(d)  f 

Jo 

(f»f( 

Jjt/2  \ 


x  p  dx. 

e~x  sinx  dx 
x  dx 


sinx  cosx 
 + 


dx 


Prove:  If  f  f  (x)  dx  exists  as  a  proper  or  improper  integral,  then 


lim 

x— *b- 


i: 


f(t)dt  =  0. 


Prove:  If  /  is  locally  integrable  on  [a,  b),  then  J  f  (x)  dx  exists  if  and  only  if  for 

>)  such  tl 

f{t)dt 


each  €  >  0  there  is  a  number  r  in  (a,  b)  such  that 

-x2 


r 


<  e 


whenever  r  <  x\,  x-i  <  b.  Hint:  See  Exercise  2.1.38. 
8.    Determine  whether  the  integral  converges  or  diverges. 


log  x  +  sin  x 


,  x  r00  i  +  cos2..  , 

(c)  /     —        =  dx 
Jo  ^ 


dx 


(b)  / 

J—t 


3/2 


(x2  +  3) 

(X4  +  l)3/2 

4  +  cos  x 


i2  x  dx 


VTT. 


/>oo 

(e)  /    (x21  +  sinx)e~x  dx 
Jo 


(d)  / 

(f)  /     x~p{2  +  smx)dx 
Jo 
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9.    Find  all  values  of  p  for  which  the  integral  converges 

™  7r/2    •  /•tt/2 


/-^sin*  /"^cos*  p  _x 

(a)  /   dx  (b)  /   dx  (c)  /     xpe  dx 

Jo       xp  Jo       xp  Jo 

r'2  sinx  r00    </jc  m  r1 

^  ;io      (tan*)*  *       le;/i    *(log*)*  Ui0  *(|log*|)* 

/  \  [n  xdx 
{g)  Jo  (sinx)P 

10.     Let  Ln(x)  be  the  iterated  logarithm  defined  in  Exercise  2.4.42.  Show  that 


./a 


Lo(*)Li(x)---Ljt(x)[Ljt+i(x)]* 


converges  if  and  only  if  /?  >  1.  Here  a  is  any  number  such  that  L^+iix)  >  0  for 
x  >  a. 

11.     Find  conditions  on  /?  and  q  such  that  the  integral  converges. 

(a)  f  {™**xly  dx  (b)  f\l-xni+xrdx 

[<*>    xPdx  [<*>  [log(l+*)F(log*)« 

(c)io  OT^  (d)it  - 


xp+q 

(log(l  +  *)  -  log*)9  ,  ...  f°°(x-smx)i 


dx 


(e)  rw  +  w^  (f)r 

xp  Jo 


dx 


xP 

12.     Let  /  and  g  be  polynomials  and  suppose  that  g  has  no  real  zeros.  Find  necessary 
and  sufficient  conditions  for  convergence  of 


f 

J—n 


 dx. 

g(x) 


13.  Prove:  If  /  and  g  are  locally  integrable  on  [a,  b)  and  the  improper  integrals  f  f2(x)dx 
and  f  g2(x)  dx  converge,  then  f  f(x)g(x)  dx  converges  absolutely.  Hint:  (/  ± 
g)2  >  0. 

14.  Suppose  that  /  is  locally  integrable  and  F(x)  =  f  f{t)  dt  is  bounded  on  [a,  b), 
and  let  /  be  nonoscillatory  at  b— .  Let  g  be  locally  integrable  and  bounded  on  [a,  b). 
Show  that 


L 


b 

\f(x)g(x)\  dx  <  oo. 


15.     Suppose  that  g  is  positive  and  nonincreasing  on  [a,b)  and  f  f(x)dx  exists  as 

a  proper  or  absolutely  convergent  improper  integral.  Show  that  f  f(x)g(x)dx 
exists  and 
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lim  J-  f  f(t)g(f)dt  =0. 


Hint:  £Aje  Exercise  3.4.6. 

16.  Showthatif  /  is  locally  integrableon  [a,  b)  and  nonoscillatory  at  b— ,  then  f  f(x)dx 
exists  if  and  only  if  f£  \f(x)\  dx  <  oo. 

17.  (a)   Prove  Theorem  3.4.12.  Hint:  See  Example  3.4.13. 

(b)    Show  that  g  satisfies  the  assumptions  of  Theorem  3.4.10  if  g'  is  locally  inte- 
grable,  g  is  monotonic  on  [a,  b),  and  \imx^j,-  g(x)  —  0. 

18.  Find  all  values  of  p  for  which  the  integral  converges  (i)  absolutely;  (ii)  condition- 
ally. 


toT  —  dx  (b)f     Sln*     dx  (c)f 

,  ,N   r°°  sinl/x  ,  .   f°°  sin2x  sin2x  f°°  sinx 

(d)/    —  ^         (e)io  xP        dx  WJ_ 


dx 


■  dx 


(1  +x2)P 

19.  Suppose  that  g"  is  absolutely  integrable  on  [0,  oo),  lirn^oo  g'{x)  =  0,  and  lim*-^  g{x)  = 
L  (finite  or  infinite).  Show  that  fQ  g(x)  sin  x  dx  converges  if  and  only  if  L  =  0. 
Hint:  Integrate  by  parts. 

20.  Let  h  be  continuous  on  [0,  oo).  Prove: 

(a)  If       e~s°xh(x)  dx  converges  absolutely,  then       e~sxh(x)  dx  converges 
absolutely  if  s  >  So. 

(b)  If      e~s°xh(x)  dx  converges,  then       e~sxh(x)  dx  converges  if  s  >  So- 

21.  Suppose  that  /  is  locally  integrable  on  [0,  oo),  limx^oo  f(x)  =  A,  and  a  >  —I. 
Find  Hindoo  x-"-1  fQ  f  (t)ta  dt,  and  prove  your  answer. 

22.  Suppose  that  /  is  continuous  and  F(x)  =  J  f  (t)  dt  is  bounded  on  [a,  b).  Suppose 
also  that  g  >  0,  g'  is  nonnegative  and  locally  integrable  on  [a,  b),  and  limx^6_  g(x)  = 
oo.  Show  that 

1  fx 

lh?  mrp  /  f^f)s(f)dt  =  o,  P  >  i. 

x^b-  [g{x)Y  ja 
HINT:  Integrate  by  parts. 

23.  In  addition  to  the  assumptions  of  Exercise  3.4.22,  assume  that     f  (t)  dt  converges. 
Show  that 

lim  ~~ TT  f  f(?)g(?)dt  =0. 

x^b-  g(x)  Ja 

Hint:  Let  F(x)  =  f  f(t)  dt,  integrate  by  parts,  and  use  Exercise  3.4.6. 
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24.     Suppose  that  /  is  continuous,  g'{x)  <  0,  and  g(x)  >  0  on  [a,  b).  Show  that  if  g'  is 
integrable  on  [a,  b)  and      f  (x)  dx  exists,  then      f  (x)g(x)  dx  exists  and 


Ul?  ~~ TT  f  f(f)g(f)dt=0. 

x^b-  g(x)  Jx 


Hint:  Lef  ^(x)  =  f  f{t)  dt,  integrate  by  parts,  and  use  Exercise  3.4.6. 

25.  Find  all  values  of  p  for  which  the  integral  converges  (i)  absolutely;  (ii)  condition- 
ally. 

p  1  rl  poo 

(a)  /    xp  sin  1  fx  dx        (b)  /    \logx\p  dx  (c)  /     jcp  cos(logx)  dx 

Jo  Jo  Ji 

/oo  poo 
(logx)p  dx         (e)  y  sinjc^t/x 

26.  Let  Mi  be  positive  and  satisfy  the  differential  equation 

u"  +  p(x)u  =  0,    0  <  x  <  oo.  (A) 

(a)    Prove:  If 

r°°  d. 

Jo  uj{ 

dt 


then  the  function 

u2(x)  =  u\{x) 


dx 

<  OO, 


Jx 


.(0 

also  satisfies  (A),  while  if 


then  the  function 

"x  dt 

U2(x)  =  Ui(x)  I  —j— 


r°°  dx 
Jo  u\(x) 

rx  d 
"1W  /  —, 

Jo  u\( 


m 

also  satisfies  (A). 

(b)    Prove:  If  (A)  has  a  solution  that  is  positive  on  [0,  oo),  then  (A)  has  solutions 
yi  and  y2  that  are  positive  on  (0,  oo)  and  have  the  following  properties: 

yi(x)y'2(x)  -y'l(x)y2{x)  =  1,    x  >  0, 
>i(*) 


.yz(x) 


<    0,    x  >  0, 


and 

lim 

yi{x) 


V  A 

lim  — - —  =  0. 
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27.  (a)   Prove:  If  h  is  continuous  on  [0,  oo),  then  the  function 

u(x)  =  c\e~x  +  c2ex  +  I    h(t)  sinh(x  -  t)  dt 
Jo 

satisfies  the  differential  equation 

u"  —  u  =  h(x),     x  >  0. 

(b)  Rewrite  u  in  the  form 

u(x)  =  a(x)e~x  +  b(x)ex 

and  show  that 

u'(x)  =  -a(x)e~x  +  b{x)ex . 

(c)  Show  that  if  lirn^oo  a(x)  =  A  (finite),  then 

lim  e2x  [b(x)-B]  =  0 

x— >oo 

for  some  constant  B.  Hint:  Use  Exercise  3.4.24.  Show  also  that 
lim  ex  [u(x)  -  Ae~x  -  Bex]  =  0. 

x— *oo 

(d)  Prove:  If  lim*-^  b(x)  =  B  (finite),  then 

lim  u(x)e~x  =  lim  u'(x)e~x  =  B. 

x— *oo  x— *oo 

HINT:  Use  Exercise  3.4.23. 

28.  Suppose  that  the  differential  equation 

u"  +  p(x)u  =  0  (A) 

has  a  positive  solution  on  [0,  oo),  and  therefore  has  two  solutions  y\  and  y2  with  the 
properties  given  in  Exercise  3.4.26(b). 

(a)   Prove:  If  h  is  continuous  on  [0,  oo)  and  c\  and  c2  are  constants,  then 


f 

Jo 


u(x)  =  cui(x)  +  c2y2(x)  +  I    h(t)  [yi{t)y2{x)  -  yi(x)y2(t)]  dt  (B) 

satisfies  the  differential  equation 

u"  +  p(x)u  =  h(x). 
For  convenience  in  (b)  and  (c),  rewrite  (B)  as 

u(x)  =  a(x)yi(x)  +  b(x)y2(x). 
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(b)  Prove:  If  /0°°  h{i)y2{i)  dt  converges,  then /0°°  h{t)y\{t)  dt  converges,  and 

u(x)  -  Ayi(x)  -  By2(x) 
lun   —  =  0 

*^oo  yi(x) 

for  some  constants  A  and  B.  Hint:  Use  Exercise  3.4.24  with  f  =  hy2  and 

g  =  yi/y%- 

(c)  Prove:  If  /0°°  h{t)y\(t)  dt  converges,  then 

,.  u(x) 

hm  — —  =  B 

yi(x) 


for  some  constant  B.  Hint:  Use  Exercise  3.4.23  with  f  =  hy\  and  g  = 

yi/yi- 

29.  Suppose  that  /,  f\,  and  g  are  continuous,  /  >  0,  and  (f\/ /)'  is  absolutely  inte- 
grable  on  [a,b).  Show  that  f  f\{x)g(x)  dx  converges  if  f  f  (x)g(x)  dx  does. 

30.  Let  g  be  locally  integrable  and  /  continuous,  with  f(x)  >  p  >  0  on  [a,  b).  Sup- 
pose that  for  some  positive  M  and  for  every  r  in  [a,  b)  there  are  points  x\  and  x2 
such  that  (a)  r  <  x\  <  x2  <  b;  (b)  g  does  not  change  sign  in  [xi,X2];  and 
(c)  \g(x)\ dx  >  M.  Show  that  fa  f(x)g(pc)dx  diverges.  Hint:  Use  Exer- 
cise 3.4.7  and  Theorem  3.3.7. 


3.5  A  MORE  ADVANCED  LOOK  AT  THE  EXISTENCE  OF 
THE  PROPER  RIEMANN  INTEGRAL 


In  Section  3.2  we  found  necessary  and  sufficient  conditions  for  existence  of  the  proper 
Riemann  integral,  and  in  Section  3.3  we  used  them  to  study  the  properties  of  the  integral. 
However,  it  is  awkward  to  apply  these  conditions  to  a  specific  function  and  determine 
whether  it  is  integrable,  since  they  require  computations  of  upper  and  lower  sums  and 
upper  and  lower  integrals,  which  may  be  difficult.  The  main  result  of  this  section  is  an 
integrability  criterion  due  to  Lebesgue  that  does  not  require  computation,  but  has  to  do 
with  how  badly  discontinuous  a  function  may  be  and  still  be  integrable. 

We  emphasize  that  we  are  again  considering  proper  integrals  of  bounded  functions  on 
finite  intervals. 

Definition  3.5.1  If  /  is  bounded  on  [a,  b],  the  oscillation  of  f  on  [a,  b]  is  defined  by 
Wf[a,b]=     sup  \f(x)-f(x% 

a<x,x' <b 

which  can  also  be  written  as 

Wf[a,b]=    sup  /(*)-    inf  f(x) 

a<x<b  a<x<b 
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(  Exercise  3.5.1).  If  a  <  x  <  b,  the  oscillation  of  f  at  x  is  defined  by 

w  f(x)  =  lim  Wf(x  —  h,  x  +  h). 

The  corresponding  definitions  for  x  =  a  and  x  =  b  are 

w  f(a)  =   lim  Wf(a,a  +  h)    and     Wf(b)=   lim  Wf(b  —  h,  b).  ■ 
h^o+  h^o+ 

For  a  fixed  x  in  (a,  b),  W/(x  —  h,  x  +  h)  is  a  nonnegative  and  nondecreasing  function 
of  h  for  0  <  h  <  min(x  —  a,b  —  x);  therefore,  Wf(x)  exists  and  is  nonnegative,  by 
Theorem  2.1.9.  Similar  arguments  apply  to  if /(a)  and  w f{b). 

Theorem  3.5.2  Let  f  be  defined  on  [a,b\.  Then  f  is  continuous  at  Xo  in  [a,b]  if 
and  only  if  w  f(xo)  =  0.  (Continuity  at  a  or  b  means  continuity  from  the  right  or  left, 
respectively.) 

Proof    Suppose  that  a  <  xo  <  b.  First,  suppose  that  w  f(xo)  =  0  and  e  >  0.  Then 

Wf  [xq  —  h,  xo  +  h]  <  e 

for  some  h  >  0,  so 

\f(x)  —  f{x')\  <  e    if    xo  —  h  <  x,  x'  <  xo  +  h. 

Letting  x'  =  xo,  we  conclude  that 

\f(x)- f(x0)\  <€    if  \x-x0\<h. 

Therefore,  /  is  continuous  at  xo. 

Conversely,  if  /  is  continuous  at  xo  and  e  >  0,  there  is  a  S  >  0  such  that 

|/(*)-/(*o)l  <\    and    | /(*')- /(*o)  I  <\ 
if  xo  —  S  <  x,  x'  <  xo  +  S.  From  the  triangle  inequality, 

l/W  -  f{x')\  <  \f(x)  -  f(x0)\  +  I/O')  -  f(x0)\  <  €. 

so 

Wf[xo—h,xo  +  h]<e    if   h  <  8: 
therefore,  w  f(xo)  =  0.  Similar  arguments  apply  if  xo  =  a  or  xo  =  b.  TJ 

Lemma  3.5.3  If  w/(x)  <  e  for  a  <  x  <  b,  then  there  is  a  &  >  0  such  that 
Wf[a\,  b\]  <  e,  provided  that  [a\,b\]  C  [a,  b]  and  b\  —  ci\  <  8. 

Proof  We  use  the  Heine-Borel  theorem  (Theorem  1.3.7).  If  Wf(x)  <  e,  there  is  an 
hx  >  0  such  that 

\f(x')-f(x")\<e  (3.5.1) 
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if 

x  -2hx  <  x',x"  <  x  +  2hx    and    x',x"  £  [a,b].  (3.5.2) 
If  Ix  =  (x  —  hx,  x  +  hx),  then  the  collection 

X  =  {h  \  a  <  x  <  b) 

is  an  open  covering  of  [a,  b],  so  the  Heine-Borel  theorem  implies  that  there  are  finitely 
many  points  x\,  x-i,       xn  in  [a,  b]  such  that  IX1,  IX2, IXn  cover  [a,  b].  Let 

h  =  min  hXi 

\<i<n 

and  suppose  that  [a i ,  i ]  C  [a,b]  and  b\  —  a\  <  h.  If  x'  and  x"  are  in  [a\,b\],  then 
x'  e  IXl.  for  some  r  (1  <  r  <  n),  so 

\x'  -  xr  \  <hXr. 

Therefore, 

|jc"  -  xr\  <  |jc"  -  x'\  +  \x'  -  xr\  <bi-ai  +  hXr 
<    h  +  hXr  <2hXr. 

Thus,  any  two  points  x'  and  x"  in  [a\,  b\]  satisfy  (3.5.2)  with  x  =  xr,  so  they  also  satisfy 
(3.5.1).  Therefore,  e  is  an  upper  bound  for  the  set 

{\f(x')-nx")\\x',x"e[a1,b1]}, 

which  has  the  supremum  W/[ai,  b\\.  Hence,  Wf[a\,  b\]  <  e,  H 
In  the  following,  L(I)  is  the  length  of  the  interval  /. 

Lemma  3.5.4  Let  f  be  bounded  on  [a ,  b]  and  define 

Ep  =  {x  e  [a,  b]  |  w /(x)  >  p} . 

Then  Ep  is  closed,  and  f  is  integrable  on  [a,  b]  if  and  only  if  for  every  pair  of  positive 
numbers  p  and  8,  Ep  can  be  covered  by  finitely  many  open  intervals  I\,  1%,. . . ,  Ip  such 
that 

p 

J2L(Ij)<S.  (3.5.3) 
./=i 

Proof  We  first  show  that  Ep  is  closed.  Suppose  that  xq  is  a  limit  point  of  Ep.  If  h  >  0, 
there  is  an  x  from  Ep  in  (xo  —  h,  xo  +  h).  Since  [x  —  hi,  x  +  hi]  C  [xo  —  h,  xq  +  h]  for 
sufficiently  small  h\  and  Wf  [x  —  hi,  x  +  h\]  >  p,  it  follows  that  Wf[xo  —  h,  xo  +  h]  >  p 
for  all  h  >  0.  This  implies  that  xo  6  Ep,  so  Ep  is  closed  (Corollary  1.3.6). 

Now  we  will  show  that  the  stated  condition  in  necessary  for  integrability.  Suppose  that 
the  condition  is  not  satisfied;  that  is,  there  is  a  p  >  0  and  a  S  >  0  such  that 

p 

£L(/;)><5 
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for  every  finite  set  {7i,  1%, . . . ,  Ip}  of  open  intervals  covering  Ep.  If  P  =  {xq,  Xi,  ...  ,Xn} 
is  a  partition  of  [a,  b],  then 

s(P)  - s(P)  =  j2(M.i  - m.i)(xJ  - xj-i)  +       - mjXxJ  - XJ~^'  (3-5A) 

jeA  j€B 

where 

A  =  {j  |  [xj-i,xj]  n  Ep  ^  0}     and    B  =  {j  \  [xj-i,xj]  nEp  =  0}. 

Since  U/ex(3Cy-i'  J<:i)  contains  all  points  of  £p  except  any  of  Xo,  Xi,  Xn  that  may 
be  in  Ep,  and  each  of  these  finitely  many  possible  exceptions  can  be  covered  by  an  open 
interval  of  length  as  small  as  we  please,  our  assumption  on  Ep  implies  that 

-Xj-i)  >  8. 

j£A 


Moreover,  if  j  &  A,  then 
so  (3.5.4)  implies  that 


Mj  -  nij  >  p, 


S(P)  -  s(P)  >  p  ~  xj-i)  > 

jeA 

Since  this  holds  for  every  partition  of  [a,  b],  f  is  not  integrable  on  [a,b],  by  Theorem  3.2.7. 
This  proves  that  the  stated  condition  is  necessary  for  integrability. 

For  sufficiency,  let  p  and  S  be  positive  numbers  and  let  /i,  1 2,  ■  ■  ■ ,  lp  be  open  intervals 
that  cover  Ep  and  satisfy  (3.5.3).  Let 

7 j  =  [a,b]  n7y. 

(/  j  =  closure  of  /  .)  After  combining  any  of  I \,  1 2,  . . . ,  I p  that  overlap,  we  obtain  a  set 
of  pairwise  disjoint  closed  subintervals 

Cj=[ctj,Pj\,  l<j<q(<p), 

of  [a,  b]  such  that 

a  <  ai  <  Pi  <  a2  <  fi2  ■  ■  ■  <  ctq-i  <  Pq-l  <  0Lq  <  pq  <  b,  (3.5.5) 
i 

£(0i-Oi)<«  (3.5.6) 

!  =  1 

and 

io/(x)  <  p,    P/  <  x  <  ay+i,     1  <  /  <  ^  —  1. 
Also,  w  f{x)  <  p  for  a  <  x  <  a\  \f  a  <  a\  and  for  Pq  <  x  <  b  if  Pq  <  b. 
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Let  Po  be  the  partition  of  [a,  b]  with  the  partition  points  indicated  in  (3.5.5),  and  refine 
Po  by  partitioning  each  subinterval  [Bj, a/+i]  (as  well  as  if  a  <  ai  and  [fiq.b] 

if  f)q  <  b)  into  subintervals  on  which  the  oscillation  of  /  is  not  greater  than  p.  This  is 
possible  by  Lemma  3.5.3.  In  this  way,  after  renaming  the  entire  collection  of  partition 

points,  we  obtain  a  partition  P  =  {xq,  x\  xn}  of  [a,  b]  for  which  S(P)  —  s(P)  can  be 

written  as  in  (3.5.4),  with 

<? 

jeA  (=1 


(see  (3.5.6))  and 


For  this  partition, 


Mj  —  mj  <  p,    j  e  B. 


Y  (Mj  -  mj){xj  -  xj-i)  <2KY^  (*j  ~  Xj-i)  <  2K^, 
j^A  jeA 

where  K  is  an  upper  bound  for  \  f\  on  [a ,  b]  and 

(Mj  -mj)(xj  -xj-i)  <  p(b-a). 

We  have  now  shown  that  if  p  and  S  are  arbitrary  positive  numbers,  there  is  a  partition  P  of 
[a,  b]  such  that 

S(P)-s(P)  <2K8  +  p(b-a).  (3.5.7) 


If€  >  0,  let 


Then  (3.5.7)  yields 


8  =    and    p  = 


4K  r  2(b-a) 


S(P)-s(P)  <  e, 

and  Theorem  3.2.7  implies  that  /  is  integrable  on  [a,  b].  TJ 
We  need  the  next  definition  to  state  Lebesgue's  integrability  condition. 


Definition  3.5.5  A  subset  S  of  the  real  line  is  of  Lebesgue  measure  zero  if  for  every 
e  >  0  there  is  a  finite  or  infinite  sequence  of  open  intervals  I\ ,  1%, . . ,  such  that 

S  C  (J  Ij  (3.5.8) 

j 

and 

n 

J2L(Ij)<e>  n  >  L  (3-5-9) 
J=1 
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Note  that  any  subset  of  a  set  of  Lebesgue  measure  zero  is  also  of  Lebesgue  measure  zero. 
(Why?) 

Example  3.5.1  The  empty  set  is  of  Lebesgue  measure  zero,  since  it  is  contained  in 
any  open  interval. 

Example  3.5.2  Any  finite  set  S  =  {xi,X2,  ...,Xn}  is  of  Lebesgue  measure  zero, 
since  we  can  choose  open  intervals  I\,  I2,  In  such  that  Xj  €  /;  and  L(I j)  <  e/n, 
1  <  j  <  n. 

Example  3.5.3  An  infinite  set  is  denumerable  if  its  members  can  be  listed  in  a  se- 
quence (that  is,  in  a  one-to-one  correspondence  with  the  positive  integers);  thus, 

S  =  {xi,x2,...,x„,...}.  (3.5.10) 

An  infinite  set  that  does  not  have  this  property  is  nondenumerable .  Any  denumerable  set 
(3.5.10)  is  of  Lebesgue  measure  zero,  since  if  e  >  0,  it  is  possible  to  choose  open  intervals 
I\,  I2, so  that  Xj  e  Ij  and  L(Ij  )  <  2~'e,  j  >  1.  Then  (3.5.9)  holds  because 

111  1  1 

— I — —  H — +  •  -  •  H  =  1  <  1.  (3.5.11) 

2     22      23  2"  2" 

■ 

There  are  also  nondenumerable  sets  of  Lebesgue  measure  zero,  but  it  is  beyond  the  scope 
of  this  book  to  discuss  examples. 

The  next  theorem  is  the  main  result  of  this  section. 

Theorem  3.5.6  A  bounded  function  f  is  integrable  on  a  finite  interval  [a,b]  if  and 
only  if  the  set  S  of  discontinuities  of  f  in  [a,  b]  is  of  Lebesgue  measure  zero. 

Proof   From  Theorem  3.5.2, 

S  =  {x  e  [a,  b]  I  w f(x)  >  0}. 

Since  w  f(x)  >  0  if  and  only  if  w /(x)  >  1/  i  for  some  positive  integer  i,  we  can  write 

00 

S=ySi,  (3.5.12) 

!  =  1 

where 

Sj  =  {x  e  [a,  b]  |  w f(x)  >  1/z'} . 

Now  suppose  that  /  is  integrable  on  [a,  b]  and  e  >  0.  From  Lemma  3.5.4,  each  Si  can 
be  covered  by  a  finite  number  of  open  intervals  In,  1,2,  •  •  • ,  hn  of  total  length  less  than 
e/2l .  We  simply  renumber  these  intervals  consecutively;  thus, 

h,  1%,  •  ■  •  =  h\,  ■■  ■ ,  hm,  Hi,  ■  ■  ■ ,  l2n2>  ■  ■  ■  >  hi,  ■  ■  ■ ,  Iint,  

Now  (3.5.8)  and  (3.5.9)  hold  because  of  (3.5.1 1)  and  (3.5.12),  and  we  have  shown  that  the 
stated  condition  is  necessary  for  integrability. 
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For  sufficiency,  suppose  that  the  stated  condition  holds  and  e  >  0.  Then  5  can  be 
covered  by  open  intervals  I\,  1%, . . .  that  satisfy  (3.5.9).  If  p  >  0,  then  the  set 

Ep  =  {x  €  [a,  b]  |  w/(x)  >  p} 

of  Lemma  3.5.4  is  contained  in  S  (Theorem  3.5.2),  and  therefore  Ep  is  covered  by  I\,  1%, . . . 
Since  Ep  is  closed  (Lemma  3.5.4)  and  bounded,  the  Heine-Borel  theorem  implies  that  Ep 

is  covered  by  a  finite  number  of  intervals  from  I\ ,  I2,  The  sum  of  the  lengths  of  the 

latter  is  less  than  e,  so  Lemma  3.5.4  implies  that  /  is  integrable  on  [a,  b\.  H 

3.5  Exercises 


1 .  In  connection  with  Definition  3 . 5 . 1 ,  show  that 

sup     |/(x)  -  f(x')\  =   sup  f(x)-   inf  f(x). 

x,x'€[a,b]  a<x<b  a<x<b 

2.  Use  Theorem  3.5.6  to  show  that  if  /  is  integrable  on  [a,  b],  then  so  is  \  f\  and,  if 
f(x)  >  p  >  0  (a  <  x  <  b),  so  is  1//. 

3.  Prove:  The  union  of  two  sets  of  Lebesgue  measure  zero  is  of  Lebesgue  measure 
zero. 

4.  Use  Theorem  3.5.6  and  Exercise  3.5.3  to  show  that  if  /  and  g  are  integrable  on 
[a,  b],  then  so  are  f  +  g  and  fg. 

5.  Suppose  /  is  integrable  on  [a,  b],  a  =  infa<^<fe  f(x),  and  p1  =  supa<x<fe  f(x). 
Let  g  be  continuous  on  [a,  /}].  Show  that  the  composition  h  =  g  o  f  is  integrable 
on  [a,  b]. 

6.  Let  /  be  integrable  on  [a,  b],  let  a  =  infa<x<6  / (x)  and  f)  =  supa<;c<fe  / (x),  and 
suppose  that  G  is  continuous  on  [a,  {}].  For  each  n  >  1,  let 


(j  —  l)(b  —  a)                       ,  j(b-a)      .  . 
a  H  <  Ujn,  Vjn  <  a  H  ,     1  <  j  <  n. 


Show  that 


lim  -  Y  \G{f{ujn))  ~  G(f(vJn))\  =  0. 


7=1 

7.  Let  /j(x)  =  0  for  all  x  in  [a,  b]  except  for  x  in  a  set  of  Lebesgue  measure  zero. 
Show  that  if  f  h(x)  dx  exists,  it  equals  zero.  Hint:  Any  subset  of  a  set  of  measure 
zero  is  also  of  measure  zero. 

8.  Suppose  that  /  and  g  are  integrable  on  [a,  b]  and  / (x)  =  g(x)  except  for  x  in  a  set 
of  Lebesgue  measure  zero.  Show  that 

»b  Pb 


I    f(x)dx=  I  g(x)dx. 

J  a  J  a 


CHAPTER  4 
Infinite  Sequences  and  Series 


IN  THIS  CHAPTER  we  consider  infinite  sequences  and  series  of  constants  and  functions 
of  a  real  variable. 

SECTION  4.1  introduces  infinite  sequences  of  real  numbers.  The  concept  of  a  limit  of  a 
sequence  is  defined,  as  is  the  concept  of  divergence  of  a  sequence  to  ±00.  We  discuss 
bounded  sequences  and  mono  tonic  sequences.  The  limit  inferior  and  limit  superior  of  a 
sequence  are  defined.  We  prove  the  Cauchy  convergence  criterion  for  sequences  of  real 
numbers. 

SECTION  4.2  defines  a  subsequence  of  an  infinite  sequence.  We  show  that  if  a  sequence 
converges  to  a  limit  or  diverges  to  ±00,  then  so  do  all  subsequences  of  the  sequence.  Limit 
points  and  boundedness  of  a  set  of  real  numbers  are  discussed  in  terms  of  sequences  of 
members  of  the  set.  Continuity  and  boundedness  of  a  function  are  discussed  in  terms  of  the 
values  of  the  function  at  sequences  of  points  in  its  domain. 

SECTION  4.3  introduces  concepts  of  convergence  and  divergence  to  ±00  for  infinite  series 
of  constants.  We  prove  Cauchy 's  convergence  criterion  for  a  series  of  constants.  In  con- 
nection with  series  of  positive  terms,  we  consider  the  comparison  test,  the  integral  test,  the 
ratio  test,  and  Raabe's  test.  For  general  series,  we  consider  absolute  and  conditional  con- 
vergence, Dirichlet's  test,  rearrangement  of  terms,  and  multiplication  of  one  infinite  series 
by  another. 

SECTION  4.4  deals  with  pointwise  and  uniform  convergence  of  sequences  and  series  of 
functions.  Cauchy 's  uniform  convergence  criteria  for  sequences  and  series  are  proved,  as 
is  Dirichlet's  test  for  uniform  convergence  of  a  series.  We  give  sufficient  conditions  for 
the  limit  of  a  sequence  of  functions  or  the  sum  of  an  infinite  series  of  functions  to  be 
continuous,  integrable,  or  differentiable. 

SECTION  4.5  considers  power  series.  It  is  shown  that  a  power  series  that  converges  on 
an  open  interval  defines  an  infinitely  differentiable  function  on  that  interval.  We  define 
the  Taylor  series  of  an  infinitely  differentiable  function,  and  give  sufficient  conditions  for 
the  Taylor  series  to  converge  to  the  function  on  some  interval.  Arithmetic  operations  with 
power  series  are  discussed. 
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4.1  SEQUENCES  OF  REAL  NUMBERS 


An  infinite  sequence  (more  briefly,  a  sequence)  of  real  numbers  is  a  real-valued  function 
defined  on  a  set  of  integers  We  call  the  values  of  the  function  the  terms  of  the 

sequence.  We  denote  a  sequence  by  listing  its  terms  in  order;  thus, 


{s„}f  =  {sk,sk+l,...}.  (4.1.1) 


For  example, 


-u.1  1 


and 


n2  +  1 )  0       (     2  5        n2  +  1 

{(-in;r= {i.-i,!,...  .(-if,...}, 


=  <i,U 

'23 


The  real  number  s„  is  the  nth  term  of  the  sequence.  Usually  we  are  interested  only  in  the 
terms  of  a  sequence  and  the  order  in  which  they  appear,  but  not  in  the  particular  value  of  k 
in  (4.1.1).  Therefore,  we  regard  the  sequences 

1 


\  oo 

,  n-2 
as  identical. 

We  will  usually  write  {sn}  rather  than  In  the  absence  of  any  indication  to  the 

contrary,  we  take  k  =  0  unless  sn  is  given  by  a  rule  that  is  invalid  for  some  nonnegative 
integer,  in  which  case  k  is  understood  to  be  the  smallest  positive  integer  such  that  sn  is 
defined  for  all  n  >  k.  For  example,  if 

1 


(«-1)(m-5) 

then  k  =  6. 

The  interesting  questions  about  a  sequence  {sn}  concern  the  behavior  of  sn  for  large  n. 
Limit  of  a  Sequence 

Definition  4.1.1  A  sequence  {s„}  converges  to  a  limit  s  if  for  every  e  >  0  there  is  an 
integer  N  such  that 

\sn-s\<e    if    n  >  N.  (4.1.2) 
In  this  case  we  say  that  {sn}  is  convergent  and  write 

lim  s„  =  s. 

n— >oo 

A  sequence  that  does  not  converge  diverges,  or  is  divergent  ■ 
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As  we  saw  in  Section  2.1  when  discussing  limits  of  functions,  Definition  4.1.1  is  not 
changed  by  replacing  (4.1.2)  with 

\s„  -s\  <  Ke    if   n  >  N, 

where  K  is  a  positive  constant. 


Example  4.1.1  If  sn  =  c  for n  >  k,  then  \s„  —  c\  =  0  forn  >  k,  and  limn^oo  sn  =  c. 


Example  4.1.2  If 


then  lim„^oo  s„  =  2,  since 


In  +  1     2«  +  2 


n  +  1       n  +  1 


«  +  1 


hence,  if  e  >  0,  then  (4.1.2)  holds  with s  =  2if  N  >  l/e. 


Definition  4.1.1  does  not  require  that  there  be  an  integer  N  such  that  (4.1.2)  holds  for 
all  e;  rather,  it  requires  that  for  each  positive  e  there  be  an  integer  N  that  satisfies  (4.1.2) 
for  that  particular  e.  Usually,  N  depends  on  e  and  must  be  increased  if  e  is  decreased.  The 
constant  sequences  (Example  4.1.1)  are  essentially  the  only  ones  for  which  N  does  not 
depend  on  e  (Exercise  4.1.5). 

We  say  that  the  terms  of  a  sequence  {sn  }£°  satisfy  a  given  condition/or  all  n  if  s„  satisfies 
the  condition  for  all  n  >  k,  or  for  large  n  if  there  is  an  integer  N  >  k  such  that  sn  satisfies 
the  condition  whenever  n  >  N.  For  example,  the  terms  of  {l/«}^°  are  positive  for  all  n, 
while  those  of  { 1  —  7/«}j°  are  positive  for  large  n  (take  N  =  8). 

Uniqueness  of  the  Limit 

Theorem  4.1.2  The  limit  of  a  convergent  sequence  is  unique. 
Proof    Suppose  that 

lim  sn  =  s    and      lim  sn  =  s' '. 

n— >oo  n— >oo 

We  must  show  that  s  =  s'.  Let  e  >  0.  From  Definition  4.1.1,  there  are  integers  Ni  and  N2 
such  that 

\sn  -  s\  <  e    if   n  >  Ni 


(because  lim„^oo  s„  =  s),  and 


\s„  -  s'\  <  e    if   n  >  N2 
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(because  lim„^oo  sn  =  s').  These  inequalities  both  hold  if  n  >  N  =  max(A^i,  N2),  which 
implies  that 

=  \(s  -  sN)  +  (sN  -  s')\ 

<  |j  —  sn\  +  \sn  —  s'\  <  e  +  e  =  2e. 

Since  this  inequality  holds  for  every  e  >  0  and  \s  —  s'\  is  independent  of  e,  we  conclude 
that  \s  —  s'\  =  0;  that  is,  s  =  s'.  TJ 

Sequences  Diverging  to  ±00 
We  say  that 

lim  s„  =  00 

n— >oo 

if  for  any  real  number  a,  sn  >  a  for  large  n.  Similarly, 

lim  s„  =  —00 

n— >oo 

if  for  any  real  number  a,  s„  <  a  for  large  n.  However,  we  do  not  regard  {sn}  as  convergent 
unless  lim„^oo  s„  is  finite,  as  required  by  Definition  4.1.1.  To  emphasize  this  distinction, 
we  say  that  {sn}  diverges  to  00  (—00)  if  limn^oo  sn  =  00  (—00). 

Example  4.1.3  The  sequence  {n/2+  l/n}  diverges  to  00,  since,  if  a  is  any  real  num- 
ber, then 

n  1 

— I —  >  a    if    n  >  2a. 
2  n 

The  sequence  {n  —  n2}  diverges  to  —00,  since,  if  a  is  any  real  number,  then 
— n2  +  n  =  —n(n  —  1)  <  a    if   n  >  1  +  \f\a\. 

Therefore,  we  write 


and 

lim  (— n2  +  n)  =  —00. 
The  sequence  {(— l)"n3}  diverges,  but  not  to  —00  or  00. 

Bounded  Sequences 

Definition  4.1.3  A  sequence  {sn}  is  bounded  above  if  there  is  a  real  number  b  such 
that 

s„  <  b    for  all  n , 
bounded  below  if  there  is  a  real  number  a  such  that 

sn>  a    for  all  n , 

or  bounded  if  there  is  a  real  number  r  such  that 


\sn\  <  r    for  all  n. 
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Example  4.1.4  If  s„  =  [1  +  {—l)"]n,  then  {s„}  is  bounded  below  (s„  >  0)  but 
unbounded  above,  and  {— sn}  is  bounded  above  (— s„  <  0)  but  unbounded  below.  If  s„  = 
(— 1)",  then  {s„}  is  bounded.  If  sn  =  (—  \)nn,  then  {sn}  is  not  bounded  above  or  below. 

Theorem  4.1.4  A  convergent  sequence  is  bounded. 

Proof  By  taking  e  =  1  in  (4.1.2),  we  see  that  if  lim„^.oo  sn  =  s,  then  there  is  an  integer 
N  such  that 

\sn-s\<l    if   n  >  N. 

Therefore, 

k»|  =  |(s»-*)  +  *|<|*ii-*|  +  |*|<l  +  M    if  n>N, 

and 

\sn\  <  max{|s0|,  kiV-il,  1  +  \s\} 

for  all  n,  so  {sn}  is  bounded.  TJ 

Monotonic  Sequences 

Definition  4.1.5  A  sequence  {s„}  is  nondecreasing  if  sn  >  i„_i  for  all  «,  or  nonin- 
creasing  if  5„  <         for  all  n.  A  monotonic  sequence  is  a  sequence  that  is  either  nonin- 
creasing  or  nondecreasing.  If^„  >  sn-\  for  all  n,  then  {sn}  is  increasing,  while  if  i„  < 
for  all  n,  {sn}  is  decreasing. 

Theorem  4.1.6 

(a)  If{sn}  is  nondecreasing,  then  linin^oo  sn  =  sup{i„}. 

(b)  If{sn}  is  nonincreasing ,  then  lim„^oo  sn  =  inf{^„}. 

Proof    (a).  Let    =  sup{*„}.  If    <  oo,  Theorem  1.1.3  implies  that  if  e  >  0  then 

P  -  €  <  sN  <  P 

for  some  integer  N.  Since  sn  <  sn  <  P  if  «  >  iV ,  it  follows  that 

P  -  €  <  sn  <  P    if   n>  N, 

This  implies  that  | i„ —  j6 1  <eifn  >  N,  so  lim„^oo  i„  =  /},  by  Definition  4. 1.1.  If  P  =  oo 
and  b  is  any  real  number,  then  sn  >  b  for  some  integer  N .  Then  sn  >  b  for  «  >  N,  so 
lim„^oo  s„  =  oo. 

We  leave  the  proof  of  (b)  to  you  (Exercise  4.1.8)  H 

Example  4.1.5  Ifso  =  1  ands„  =  1—  e~s"~l,  thenO  <  s„  <  1  for  all  n,  by  induction. 
Since 

sn+i  -sn  =  -(e~s"  -  e5"-1)    if   n  >  1, 
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the  mean  value  theorem  (Theorem  2.3.1 1)  implies  that 

sn+i -s„  =  e-r"(s„ -sH-i)    if   n>\,  (4.1.3) 

where  tn  is  between  sn- \  and  sn .  Since  s\  —  sq  =  —  l/e  <  0,  it  follows  by  induction  from 
(4.1.3)  that  sn+i  —  s„  <  0  for  all  n.  Hence,  {sn}  is  bounded  and  decreasing,  and  therefore 
convergent. 


Sequences  of  Functional  Values 

The  next  theorem  enables  us  to  apply  the  theory  of  limits  developed  in  Section  2.1  to  some 
sequences.  We  leave  the  proof  to  you  (Exercise  4.1.13). 

Theorem  4.1.7  Let  limx^.oo  f  (x)  =  L,  where  L  is  in  the  extended  reals,  and  suppose 
that  sn  =  f{n)  for  large  n .  Then 

lim  sn  =  L. 

n— >oo 


Example  4.1.6  Let 


s„  =    and    j(x)  =   . 


By  L'Hospital's  rule, 

log  x  \jx 
lim    =  lim           =  0. 

x— >oo     x  x~ 1 

Hence,  limn-xx,  log«/«  =  0. 


Example  4.1.7  LeU„  =  (1  +  l/n)n  and 

f(x)  IV  =  e^og(l+lM) 


By  L'Hospital's  rule, 


f.,l\      ,.     log(l  +  l/x) 
lim  x  log  (  1  H —  I  —  lim   ;  

x->oo  y         X  J       x->oo  l/x 

1  1 


x2l  +  l/x  , 
=  lim   r—^1 —  =  1; 

x-*oa  —l/X2 


hence, 


lim 

x— >oo 


(  1  +  I  )    =  e1  =  e    and     lim  (  1  +  -  )    =  e. 

\       x  J  «->oo  y       n ) 


The  last  equation  is  sometimes  used  to  define  e. 
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Example  4.1.8  Suppose  that  s„  =  p"  withp  >  0,  and  let  f(x)  =  px  =  exl°BP.  Since 

[0,      iflogp<0  (0<p<l), 
lim  exlosP  =  \  1,      iflogp  =  0    (p  =  1), 
loo,    if  logp  >  0    (p  >  1), 


it  follows  that 


Therefore, 


[0,  0<p<l, 
lim  pn  =  il,  p=l, 
loo,    p  >  1. 


lim  rn  =  {\,     r  =  1 


0,  -1  <  r  <  1, 

1,  r  =  1, 
oo,    r  >  1, 


a  result  that  we  will  use  often. 


A  Useful  Limit  Theorem 

The  next  theorem  enables  us  to  investigate  convergence  of  sequences  by  examining  simpler 
sequences.  It  is  analogous  to  Theorem  2.1.4. 

Theorem  4.1.8  Let 

lim  s„  =  s    and     lim  tn  =  t,  (4.1.4) 

n— »oo  n — >oo 

where  s  and  t  are  finite.  Then 

lim  (cs„)  =  cs  (4.1.5) 

n— >oo 

if  c  is  a  constant, 

lim  (s„  +  t„)  =  s  +  t,  (4.1.6) 

n— >oo 

lim  (s„  -  tn)  =  s  -  t,  (4.1.7) 

n— >oo 

lim  (s„t»)  =  st,  (4.1.8) 

n— too 

and 

s„  s 

lim  —  =  -  (4.1.9) 

«->oo  t„  t 

if  t„  is  nonzero  for  all  n  and  t  ^  0. 

Proof  We  prove  (4.1.8)  and  (4.1.9)  and  leave  the  rest  to  you  (Exercises  4.1.15  and 
4.1.17).  For  (4.1.8),  we  write 
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hence, 

\s„t„-st\  <  \s„-s\  \t„\  +  \s\  \t„-t\.  (4.1.10) 

Since  {tn}  converges,  it  is  bounded  (Theorem  4.1.4).  Therefore,  there  is  a  number  R  such 
that  \tn\  <  R  for  all  n,  and  (4.1.10)  implies  that 

\snt„-st\  <  R\s„-s\  +  \s\  \t„-t\.  (4.1.11) 

From  (4. 1 .4),  if  e  >  0  there  are  integers  N\  and  N2  such  that 


s  I  <  €    if   n  >  N\ 


(4.1.12) 


and 

\tn-t\<€    if   n>N2.  (4.1.13) 

If  N  =  max(Ni,N2),  then  (4.1.12)  and  (4.1.13)  both  hold  when  n  >  N,  and  (4.1.11) 
implies  that 

\sntn -st\  <  (R  +  \s\)e    if  n>N. 

This  proves  (4.1.8). 

Now  consider  (4. 1 .9)  in  the  special  case  where  sn  =  1  for  all  n  and  t  ^  0;  thus,  we  want 
to  show  that 

1  1 
lim  —  =  -. 

n^oo  tn  t 

First,  observe  that  since  limn-j.oo  f„  =t  7^  0,  there  is  an  integer  M  such  that  \tn  \  >  \t\/2 
if  n  >  M.  To  see  this,  we  apply  Definition  4.1.1  with  e  =  \t |/2;  thus,  there  is  an  integer 
M  such  that  \tn  -  t\  <  \t/2\  if  n  >  M.  Therefore, 


\tn\  =  V  +  (tn~t)\  >  \\A  ~ \tn~t\\  >  y     if     n  >  M, 


If  e  >  0,  choose  JV0  so  that  | r«  —  r |  <  e  if  n  >  ^0,  and  let     =  max(Af0,  M).  Then 
1  1 

tn  ~t 


\t-tn\  26 

  <  — if    n  >  N; 


tn  t 


t 


hence,  limn^oo  \/tn  =  \/t.  Now  we  obtain  (4.1.9)  in  the  general  case  from  (4.1.8)  with 
{tn}  replaced  by  {l/t„}.  TJ 

Example  4.1.9  To  determine  the  limit  of  the  sequence  defined  by 

1      nil  2(l+3/«) 

sn  =  —  sin  1  ■ — , 

n        4         1  +  1/m 

we  apply  the  applicable  parts  of  Theorem  4.1.8  as  follows: 

1      nn     2  [  lim  1  +  3  lim  (1/«)1 

,.  1    .    nJl    ,  Ln— »oo  n— »oo  J 

lim  sn  =  lim  —  sin  1  ;  

n^oo        «->oo  n        4  lim  1  +  lim  (l/«) 


n— >oo  n—*oo 


2(1  +  3  ■  0) 

0+  —   =  2. 

1+0 
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Example  4.1.10  Sometimes  preliminary  manipulations  are  necessary  before  applying 
Theorem  4.1.8.  For  example, 

(«/2)  +  log«  l/2+(log«)/« 
hm   — —  =  hm   — - — 

n^oo    2>n  +  \*Jn       «->-oo     3  +  4«~1</2 

liml/2+  lim  (log«)/« 


,  ~1/2 

n—>oo 

1/2  +  0 


lim  3  +  4  lim  n 

w— *oo  n— >oo 


3  +  0 

1 

6' 


(see  Example  4. 1 .6) 


Example  4.1.11  Suppose  that  —  1  <  r  <  1  and 

i0=l,    sx  =  \  +  r,    s2  =  1  +  r  +  r2,...,    s„  =  1  +  r  H  h  rn. 

Since 

i„  -  rsn  =  (1  +  r  +  •  ■  •  +  rn)  -  (r  +  r2  +  ■  •  ■  +  r"+1)  =  1  -  rn+\ 
it  follows  that 

^  _  rn+i 

Sn  =   (4.1.14) 

1  —  r 

From  Example  4.1.8,  lim„^oo  rn+1  =  0,  so  (4.1.14)  and  Theorem  4.1.8  yield 

lim(l  +  rH  h  r")  =  if    -  1  <  r  <  1.  ■ 

n — >oo  1  —  Y 

Equations  (4.1.5)-(4.1.8)  are  valid  even  if  s  and  t  are  arbitrary  extended  reals,  provided 
that  their  right  sides  are  defined  in  the  extended  reals  (Exercises  4.1.16, 4.1.18,  and  4.1.21); 
(4.1.9)  is  valid  if  s/t  is  defined  in  the  extended  reals  and  t  ^  0  (Exercise  4.1.22). 

Example  4.1.12  If  —  1  <  r  <  l,then 

n       lim  r"  n 

,.       '  n— too  u  n 

hm  —  =    =  —  =0, 

n^-oo  n\       lim  n\  oo 

n— >oo 

from  (4.1.9)  and  Example  4.1.8.  However,  if  r  >  1,  (4.1.9)  and  Example  4.1.8  yield 

n       lim  r" 

,.       '  n— too  lJU 

n-s-oo  n\       lim «!  oo 

n— >oo 

an  indeterminate  form.  If  r  <  —1,  then  liirin-^oo  r"  does  not  exist  in  the  extended  reals,  so 
(4.1.9)  is  not  applicable.  Theorem  4.1.7  does  not  help  either,  since  there  is  no  elementary 
function  /  such  that  f(n)  =  r" /«!.  However,  the  following  argument  shows  that 
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r" 

lim  —  =  0,     — oo  <  r  <  oo.  (4.1.15) 

n— >oq  ji\ 


There  is  an  integer  M  such  that 

Irl  1 

Li  <  _    if   n  >  M. 
n  2 

Let  A  =  rm/M!.  Then 

n-M 


r  r         r  r  ,  .  , 

—  <  a—  U  i_L  <  a:  -       .   «  >  m. 

n\   ~     M  +  \M  +2  n 


G) 


Given  e  >  0,  choose  /V  >  M  so  that  K/2N~M  <  e.  Then  |r|"/«!  <  e  if  n  >  7Y,  which 
verifies  (4.1.15). 


Limits  Superior  and  Inferior 

Requiring  a  sequence  to  converge  may  be  unnecessarily  restrictive  in  some  situations.  Of- 
ten, useful  results  can  be  obtained  from  assumptions  on  the  limit  superior  and  limit  inferior 
of  a  sequence,  which  we  consider  next. 

Theorem  4.1.9 

(a)  If  {s„}  is  bounded  above  and  does  not  diverge  to  — oo,  then  there  is  a  unique  real 
number's  such  that,  if  e  >  0, 

s„<J  +  e   forlargen  (4.1.16) 

and 

s„  >  J  —  e    for  infinitely  many  n.  (4.1.17) 

(b)  If  {sn}  is  bounded  below  and  does  not  diverge  to  oo,  then  there  is  a  unique  real 
number  s_  such  that,  if  e  >  0, 

s„  >  s_  —  €    forlargen  (4.1.18) 

and 

sn  <  s_  +  €    for  infinitely  many  n.  (4.1.19) 

Proof  We  will  prove  (a)  and  leave  the  proof  of  (b)  to  you  (Exercise  4.1.23).  Since 
{sn}  is  bounded  above,  there  is  a  number  /3  such  that  sn  <  P  for  all  n.  Since  {sn}  does  not 
diverge  to  — oo,  there  is  a  number  a  such  that  s„  >  a  for  infinitely  many  n.  If  we  define 


Mk  =  sxxp{sk,sk+l,...,sk+r,...}, 
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then  a  <  <  /3,  so  {M^}  is  bounded.  Since  {Mk}  is  nonincreasing  (why?),  it  converges, 
by  Theorem  4.1.6.  Let 

1=  lim  Mk.  (4.1.20) 

If  €  >  0,  then  Mk  <  J  +  e  for  large  /c,  and  since  sn  <  Af^  for  n  >  k,J  satisfies  (4.1.16). 
If  (4.1.17)  were  false  for  some  positive  e,  there  would  be  an  integer  K  such  that 

s„  <  s  —  e    if   n  >  K. 

However,  this  implies  that 

Mk  <  s  -  e    if    k  >K, 

which  contradicts  (4.1.20).  Therefore,  ~s  has  the  stated  properties. 

Now  we  must  show  that  ~s  is  the  only  real  number  with  the  stated  properties.  If  t  <  J,  the 
inequality 

s—t      _  J—t 
sn<t  +  — -=?-  — 

cannot  hold  for  all  large  n,  because  this  would  contradict  (4.1.17)  with  e  =  (s  —  t)/2.  If 
s  <  t,  the  inequality 

t  —  J     _     t  —  J 

s„  >  t  =  s  H  

2  2 

cannot  hold  for  infinitely  many  n ,  because  this  would  contradict  (4.1.16)  with  e  =  (t  —s) / 2. 
Therefore,  ~s  is  the  only  real  number  with  the  stated  properties.  H 

Definition  4.1.10  The  numbers  I  and  s_  defined  in  Theorem  4.1.9  are  called  the  limit 
superior  and  limit  inferior,  respectively,  of  {sn},  and  denoted  by 

~s  =  lim  s„    and    s  =  lim  s„ . 


We  also  define 


and 


lim  sn  =    oo  if  {sn}  is  not  bounded  above, 

n— >oo 

lim  s„  =  — oo  if  lim  s„  =  — oo, 

n— >oo  n— >cx) 

lim  sn  =  — oo  if  {^„}  is  not  bounded  below, 


lim  s„  =    oo    if  lim  s„  =  oo. 


Theorem  4.1.11  £Very  sequence  {s„  }  of  real  numbers  has  a  unique  limit  superior,  s, 
and  a  unique  limit  inferior,  §_,  in  the  extended  reals,  and 

s  <s.  (4.1.21) 
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Proof  The  existence  and  uniqueness  of  s  and  s_  follow  from  Theorem  4.1.9  and  Defini- 
tion 4.1.10.  If  I  and  5  are  both  finite,  then  (4.1.16)  and  (4.1.18)  imply  that 

s  —  e  <  s  +  e 

for  every  e  >  0,  which  implies  (4.1.21).  If  5  =  —  ooorI  =  oo,  then  (4.1.21)  is  obvious.  If 
s_=  oo  or  s  =  — oo,  then  (4.1.21)  follows  immediately  from  Definition  4.1.10.  H 


Example  4.1.13 


and 


lim  r" 


lim  r" 


OO, 

r  >  1, 

1, 

r  =  1, 

0, 

\r\  <  1, 

-1, 

r  =  -1. 

—oo, 

r  <  —I. 

Also, 


and 


lim  n2  =  lim  n2 


=  oo. 


lim  (-1) 


=  1,      lim  (-1) 


=  -i, 


lim  [1  +  (-1)"]  nz  =  oo,      lim  [1  +  (-1)"]  n2  =  0. 


Theorem  4.1.12  If{s„}  is  a  sequence  of  real  numbers,  then 


if  and  only  if 


lim  s„  =  s 

n—*oo 


lim  s„  =  lim  s„  =  s. 


(4.1.22) 
(4.1.23) 


Proof  If  s  =  ±oo,  the  equivalence  of  (4.1.22)  and  (4.1.23)  follows  immediately  from 
their  definitions.  If  lim„^oo  sn  =  s  (finite),  then  Definition  4. 1 . 1  implies  that  (4.1.16)- 
(4.1.19)  hold  with  I  and  s_  replaced  by  s.  Hence,  (4.1.23)  follows  from  the  uniqueness  of 
I  and  s_.  For  the  converse,  suppose  that  J  =  s_  and  let  s  denote  their  common  value.  Then 
(4.1.16)  and  (4.1.18)  imply  that 

s  —  €  <  s„  <  s  +  e 

for  large  n,  and  (4.1.22)  follows  from  Definition  4.1.1  and  the  uniqueness  of  limn-xx,  sn 
(Theorem  4.1.2).  TJ 
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Cauchy's  Convergence  Criterion 

To  determine  from  Definition  4.1.1  whether  a  sequence  has  a  limit,  it  is  necessary  to  guess 
what  the  limit  is.  (This  is  particularly  difficult  if  the  sequence  diverges!)  To  use  Theo- 
rem 4.1.12  for  this  purpose  requires  finding  J  and  s_.  The  following  convergence  criterion 
has  neither  of  these  defects. 

Theorem  4.1.13  (Cauchy's  Convergence  Criterion)  A  sequence  {sn}  of 
real  numbers  converges  if  and  only  if  for  every  e  >  0,  there  is  an  integer  N  such  that 

\s„-sm\<e    if   m,n>N.  (4.1.24) 

Proof  Suppose  that  lim„^oo  sn  =  s  and  e  >  0.  By  Definition  4.1.1,  there  is  an  integer 
N  such  that 

\sr  —  s\  <  -    if   r  >  N. 

Therefore, 

\sn- sm\  =  \(s„- s)  +  (s  -  sm)\  <  \s„  -  s\  + \s  -  sm\  <  e    if  n,m>N. 

Therefore,  the  stated  condition  is  necessary  for  convergence  of  {sn}.  To  see  that  it  is  suffi- 
cient, we  first  observe  that  it  implies  that  {sn}  is  bounded  (Exercise  4.1.27),  so  J  and  s_  are 
finite  (Theorem  4.1.9).  Now  suppose  that  e  >  0  and  N  satisfies  (4.1.24).  From  (4.1.16) 
and  (4.1.17), 

\s„-s\<e,  (4.1.25) 
for  some  integer  n  >  N  and,  from  (4.1.18)  and  (4.1.19), 

\sm-s}<€  (4.1.26) 

for  some  integer  m  >  N.  Since 

\s-  s\  =  \(s-s„)  +  (s„  -sm)  +  {sm  -s)\ 
<  \s  -  s„\  +  \s„  -  s„\  +  \sm  -s\, 

(4.1.24)-(4.1.26)  imply  that 

\s  —  s\  <  3e. 

Since  e  is  an  arbitrary  positive  number,  this  implies  that  ~s  =  s_,  so  {s„}  converges,  by 
Theorem  4.1.12.  H 

Example  4.1.14  Suppose  that 

\f'(x)\  <  r  <  1,    -oo  <  x  <  oo.  (4.1.27) 

Show  that  the  equation 

x  =  f{x)  (4.1.28) 

has  a  unique  solution. 
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Solution  To  see  that  (4.1.28)  cannot  have  more  than  one  solution,  suppose  that  x  = 
f(x)  and  x'  =  f(x').  From  (4.1.27)  and  the  mean  value  theorem  (Theorem  2.3.11), 

x-x'  =  f'(c)(x-x') 

for  some  c  between  x  and  x' .  This  and  (4.1.27)  imply  that 

\x  -  x'\  <  r\x  -  x'\. 

Since  r  <  1,  x  =  x' . 

We  will  now  show  that  (4.1.28)  has  a  solution.  With  xq  arbitrary,  define 

x„  =  f(xH-i),    n  >  1.  (4.1.29) 

We  will  show  that  {xn}  converges.  From  (4.1.29)  and  the  mean  value  theorem, 

xn+i  ~x„  =  f(xn)  -  f(xn-i)  =  f'(c„)(xn  -  xn-\), 

where  cn  is  between  xn-\  and  xn.  This  and  (4.1.27)  imply  that 

\xn+i -x„\  <  r\x„-x„-i\    if   n  >  1.  (4.1.30) 

The  inequality 

—  jc»|  <  r"|jci  —  jc0|    if   n>0,  (4.1.31) 
follows  by  induction  from  (4.1.30).  Now,  if  n  >  m, 

\X„  -Xm\  =  \(X„  -X„-l)+  (Xn-i  -X„-2)  H  h  (Xm  +  i  -Xm)\ 

<  \x„  -Xn-l\  +  \Xn-l  -X„-2\  H  h  \xm+\  -  X„\, 

and  (4.1.31)  yields 

\x„-xm\  <  \x1-x0\rm(l  +  r  +  ---  +  rn-m-1).  (4.1.32) 

In  Example  4.1.1 1  we  saw  that  the  sequence  {sk}  defined  by 

Sk  =  1  +  r  +  ■  ■  ■  +  rk 

converges  to  1/(1  —  r)  if  \r\  <  1;  moreover,  since  we  have  assumed  here  that  0  <  r  <  1, 
{sk}  is  nondecreasing,  and  therefore  Sk  <  1/(1  —  r)  for  all  k.  Therefore,  (4.1.32)  yields 

I  I        1-^1  -*0|    m  -f 

\x„  —  xm\  <  r      if    n  >  m. 

1  —  r 

Now  it  follows  that 

,  ,       |Xl-Xo|   ff     .,  AT 

\xn  —  xm\  <  r      if    n,m>  N, 

1  —  r 

and,  since  liiri/v-^oo  rN  =  0,  {xn}  converges,  by  Theorem  4.1.13.  If  x  =  limn^oo  xn,  then 
(4.1.29)  and  the  continuity  of  /  imply  that  x  =  f  (x). 
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4.1  Exercises 


1.  Prove:  If  sn  >  0  for  n  >  k  and  lim„^oo  sn  =  s,  then  s  >  0. 

2.  (a)    Show  that  lim„^oo  sn  =  s  (finite)  if  and  only  if  lim„^oo  \s„  —  s\  =  0. 

(b)    Suppose  that  \sn  —  s\  <  tn  for  large  n  and  lim„^oo  tn  =  0.   Show  that 
lim^—^-oo  Sn  =  s. 

3 .  Find  lim„^oo  sn .  Justify  your  answers  from  Definition  4.1.1. 

/  \         „        1  /i  \         u  +  n  .  .  1  .  nit 

(a)*=2+  —         (h)sn  =  —  (c),„  =  -S1n- 

4.  Find  lim„^.oo  sn.  Justify  your  answers  from  Definition 4.1.1. 

,  s  «  ...  n2  +  In  +  2 

(a)  s„  =   (b)  s„  =  —  

2«  +  V>T+7  n2  +  n 

i  \  sinw  m\  n  

(c)  j„  =  — =-  (a)  s„  =  v«z  +n-n 

yn 

5.  State  necessary  and  sufficient  conditions  on  a  convergent  sequence  {sn}  such  that 
the  integer  N  in  Definition  4.1.1  does  not  depend  upon  e. 

6.  Prove:  If  lim„^oo  5„  =  s  then  lim„^oo  \sn\  =  \s\. 

7.  Suppose  that  lim„^oo  sn  =  s  (finite)  and,  for  each  e  >  0,  \sn  —  tn  \  <  e  for  large  n. 
Show  that  lim„^oo  t„  =  s. 

8 .  Complete  the  proof  of  Theorem  4. 1 .6. 

9.  Use  Theorem  4. 1 .6  to  show  that  {sn }  converges. 

p  +  n  n 
r"  (2nV 

10.  Let  j  =  Tan-1*  be  the  solution  of  x  =  tany  such  that  —n/2  <  y  <  n/2.  Prove: 
If  xo  >  0  and  xn+\  =  Tan_1jc„  (n  >  0),  then  {xn}  converges. 

11.  Suppose  that  sq  and  A  are  positive  numbers.  Let 


1  /  A\ 

=  -  \s„  H  ,    «  >  0. 

2  V  s„J 


(a)  Show  that  sn+1  >  yf~A  if  «  >  0. 

(b)  Show  that  s„+i  <  sn  if  n  >  1. 

(c)  Show  that  s  =  lim„^oo  5„  exists. 

(d)  Find  s. 

12.  Prove:  If  {sn}  is  unbounded  and  monotonic,  then  either  lim„^oo  sn  =  ooorlim„_ 
— oo. 

1 3 .  Prove  Theorem  4 . 1 . 7 . 
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14.  Use  Theorem  4. 1 .7  to  find  linin-xx,  s„ . 

(X  -\-  n  1 
(a)  s„  =  — —    OS  >  0)  (b)  s„  =  cos  - 

p  +  n  n 

(c)  s„  =  n  sin  —  (d)  sn  =  log  n  —  n 

n 

(e)  sn  =  log(«  +  1)  -  log(n  -  1) 

15.  Suppose  that  lim„^oo  sn  =  s  (finite).  Show  that  if  c  is  a  constant,  then  lim„^oo  (cs„)  = 
cs. 

16.  Suppose  that  lim^-xx,  sn  =  s  where  s  =  ±oo.  Show  that  if  c  is  a  nonzero  constant, 
then  limn-Hx,  (csn)  =  cs. 

17.  Prove:  If  liirw  oo  Sn  —  s  and  lim^^oo  tn  —  t ,  where  s  and  t  are  finite,  then 

lim  (sn  +  tn)  =  s  +  t    and      lim  (sn  —  t„)  =  s  —  t . 

n—^oo  n—>oo 

18.  Prove:  If  limn^oo  sn  =  s  and  lim„^oo  tn  =  t,  where  s  and  t  are  in  the  extended 
reals,  then 

lim  (sn  +  t„)  =  s  +  t 

n— *-oo 

if  s  +  t  is  defined. 

19.  Suppose  that  lim„^oo  tn  =  t,  where  0  <  \t\  <  oo,  and  let  0  <  p  <  1.  Show  that 
there  is  an  integer  N  such  that  tn  >  pt  for  n  >  N  if  t  >  0,  or  tn  <  pt  for  n  >  N  if 
t  <  0.  In  either  case,  \tn\  >  p\t\  if  n  >  N. 

20.  Prove:  If  _ 

lim  —         =  0,    then      lim  5„  =  5. 

HINT:  Define  t„  =  (s„  —  s)/(s„  +  s)  and  solve  for  sn. 

21.  Prove:  if  liirin-xx,  sn  =  s  and  lim„^oo  tn  =  t,  where  s  and  t  are  in  the  extended 
reals,  then 

lim  s„tn  =  st 

n— >oo 

provided  that  st  is  defined  in  the  extended  reals. 

22.  Prove:  If  lim„^oo  s„  =  s  and  lim,,-^  t„  =  t,  then 

lim  S±  =  -  (A) 

«->oo  tn  t 

if  s/t  is  defined  in  the  extended  reals  and  (  /  0.  Give  an  example  where  s/t  is 
defined  in  the  extended  plane,  but  (A)  does  not  hold. 

23.  Prove  Theorem  4.1.9(b). 

24.  Find  ~s  and  s_. 

(a)  sn  =  [(-1)"  +  1]  n2  (b)  *„  =  (!-  r")  sin  ^ 
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(c)  s„  = 


,.2)l 


(r  -1) 


(d)  sn  =  n2  -  n 


1  +  r" 

(e)  sn  =  (-l)"tn  where  lim,,-^  t„  =  t 

25.  Find  s  and  s_. 

(a)  s„  =  (-1)"  (b)  sn  =  (-1)"  ^2  +  ^ 

,  .         n  +  {-\)"{2n  +  1)     ...  .  «tt 

(c)j„  =    (d)s„=sin  — 

n  3 

26.  Suppose  that  lim„^oo  |jB|  =  y  (finite).  Show  that  {sn}  diverges  unless  y  =  0  or  the 
terms  in  {sn}  have  the  same  sign  for  large  n.  Hint:  Use  Exercise  4.1.19. 

27.  Prove:  The  sequence  {sn}  is  bounded  if,  for  some  positive  e,  there  is  an  integer  N 
such  that  \sn  —  sm\  <  e  whenever  n,  m  >  N. 

In  Exercises  4.1.28—4.1.31,  assume  that's,  s_  {or  s),  1,  and  t_  are  in  the  extended  reals,  and 
show  that  the  given  inequalities  or  equations  hold  whenever  their  right  sides  are  defined 
{not  indeterminate). 

28.  (a)  lim  (— s„)  =  -s  (b)  lim  (— s„)  =  -s 

29.  (a)  Iliri  (s„  +  t„)  <  s  +  t  (b)  lim  (s„  + 1„)  >  s  + 1 

30.  (a)  If  sn  >0,tn>  0,  then  (i)  lim  s„t„  <  sT  and  (ii)  lhn  snt„  >  st_. 
(b)  If  sn  <0,tn>  0,  then  (i)  lim  snt„  <  st_  and  (ii)  lhn  s„t„  >  St. 

31.  (a)   If  lim  s„  =  s  >  0andf„  >  0,  then  (i)  lim  snt„  =  st  and  (ii)  lim  s„t„  =  st_. 
(b)    If  lim  s„  =  s  <  0and£„  >  0,  then  (i)  lim  sntn  =  st_and  (ii)  lim  sntn  =  sT. 

n— too  n— >oo  n^>oo 

32.  Suppose  that  {sn }  converges  and  has  only  finitely  many  distinct  terms.  Show  that  sn 
is  constant  for  large  n. 

33.  Let  so  and  s\  be  arbitrary,  and 

Sn  +  Sn-l  , 
Sn+1  =   z  '      "  -  l- 

Use  Cauchy's  convergence  criterion  to  show  that  {sn}  converges. 

OA        t  S!+S2-\  \-S„ 

34.  Let  t„  =  ,  n>\. 

n 

(a)  Prove:  If  linWoo  s„  =  s  then  lim„^oo  t„  =  s. 

(b)  Give  an  example  to  show  that  {tn}  may  converge  even  though  {sn}  does  not. 
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35.     (a)   Show  that 

^(1-f)(1-|)-(1-D=°'    if  a>0' 
HINT:  Look  at  the  logarithm  of  the  absolute  value  of  the  product. 
(b)    Conclude  from  (a)  that 


where  (  ^  1  is  the  generalized  binomial  coefficient  of  Example  2.5.3. 


4.2  EARLIER  TOPICS  REVISITED  WITH  SEQUENCES 

In  Chapter  2.3  we  used  e-8  definitions  and  arguments  to  develop  the  theory  of  limits, 
continuity,  and  differentiability;  for  example,  /  is  continuous  at  xq  if  for  each  e  >  0  there 
is  a  S  >  0  such  that  \f(x)  —  f{xo)\  <  e  when  \x  —  xq\  <  S.  The  same  theory  can  be 
developed  by  methods  based  on  sequences.  Although  we  will  not  carry  this  out  in  detail, 
we  will  develop  it  enough  to  give  some  examples.  First,  we  need  another  definition  about 
sequences. 

Definition  4.2.1  A  sequence  {t^}  is  a  subsequence  of  a  sequence  {sn}  if 

tk  =  s„k,  k>0, 

where  {n^}  is  an  increasing  infinite  sequence  of  integers  in  the  domain  of  {sn}.  We  denote 
the  subsequence  {t^}  by  {sn/c}.  ■ 

Note  that  {sn}  is  a  subsequence  of  itself,  as  can  be  seen  by  taking  =  k.  All  other 
subsequences  of  {s„ }  are  obtained  by  deleting  terms  from  {sn }  and  leaving  those  remaining 
in  their  original  relative  order. 


Example  4.2.1  If 


then  letting      =  2k  yields  the  subsequence 

{^Hii=B'4"'""2P--- 
and  letting      =  2k  +  1  yields  the  subsequence 
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Since  a  subsequence  {sn/c }  is  again  a  sequence  (with  respect  to  k),  we  may  ask  whether 
{s„k }  converges. 

Example  4.2.2  The  sequence  {sn}  defined  by 

Sn  =  (-1)"  (l  +  ^ 

does  not  converge,  but  {s„  \  has  subsequences  that  do.  For  example, 


{sue}  =  1 1  +  ttt!  and     lim  *2k  =  !• 

(        2k )  k^oo 

while 

{s2k+\}  =  1-1  ~  „■   ;  :  [  and     lim  s2/fc+i  =  -1. 

It  can  be  shown  (Exercise  4.2.1)  that  a  subsequence  {sn^}  of  {5„}  converges  to  1  if  and 
only  if  rik  is  even  for  k  sufficiently  large,  or  to  —1  if  and  only  if  is  odd  for  k  sufficiently 
large.  Otherwise,  {s„k}  diverges.  ■ 

The  sequence  in  this  example  has  subsequences  that  converge  to  different  limits.  The 
next  theorem  shows  that  if  a  sequence  converges  to  a  finite  limit  or  diverges  to  ±oo,  then 
all  its  subsequences  do  also. 

Theorem  4.2.2  // 

lim  s„  =  s    (— oo  <  s  <  oo),  (4.2.1) 


then 


for  every  subsequence  {sn/c}  of{s„). 


lim  s„k  =  s  (4.2.2) 

k— >co 


Proof  We  consider  the  case  where  s  is  finite  and  leave  the  rest  to  you  (Exercise  4.2.4). 
If  (4.2.1)  holds  and  e  >  0,  there  is  an  integer  N  such  that 

|jB  -  j|  <  e    if   n>  N. 

Since  is  an  increasing  sequence,  there  is  an  integer  K  such  that  >  N  if  k  >  K. 
Therefore, 

\s„k  -L\<€    if   k  >  K, 
which  implies  (4.2.2).  TJ 

Theorem  4.2.3  If{s„}  is  monotonic and  has  a  subsequence  {s„k}  such  that 

lim  s„k  =  s    (— oo  <  s  <  oo), 

k— >oo 

then 

lim  s„  =  s. 
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Proof  We  consider  the  case  where  {sn }  is  nondecreasing  and  leave  the  rest  to  you  (Ex- 
ercise 4.2.6).  Since  {s„k}  is  also  nondecreasing  in  this  case,  it  suffices  to  show  that 

sup{s„k}  =  sup{s„}  (4.2.3) 

and  then  apply  Theorem  4.1.6(a).  Since  the  set  of  terms  of  {s„k}  is  contained  in  the  set  of 
terms  of  {s„}, 

snp{s„}  >  sxnp{s„k}.  (4.2.4) 

Since  {sn}  is  nondecreasing,  there  is  for  every  n  an  integer  such  that  sn  <  s„k.  This 
implies  that 

sup{s„}  <  sup{sn(t}. 

This  and  (4.2.4)  imply  (4.2.3).  TJ 
Limit  Points  in  Terms  of  Sequences 

In  Section  1.3  we  defined  limit  point  in  terms  of  neighborhoods:  X  is  a  limit  point  of  a  set 
S  if  every  neighborhood  of  X  contains  points  of  S  distinct  from  X.  The  next  theorem  shows 
that  an  equivalent  definition  can  be  stated  in  terms  of  sequences. 

Theorem  4.2.4  A  point  x  is  a  limit  point  of  a  set  S  if  and  only  if  there  is  a  sequence 
{xn}  of  points  in  S  such  that  x„  ^  xforn  >  1,  and 

lim  x„  =  ~x. 

n— >oo 


Proof  For  sufficiency,  suppose  that  the  stated  condition  holds.  Then,  for  each  e  >  0, 
there  is  an  integer  N  such  that  0  <  \x„  —x  |  <  e  if  n  >  N.  Therefore,  every  e-neighborhood 
of  x  contains  infinitely  many  points  of  S.  This  means  that  x  is  a  limit  point  of  S . 

For  necessity,  let  X  be  a  limit  point  of  S.  Then,  for  every  integer  n  >  1,  the  interval 
(x  —  l/n,X  +  l/n)  contains  a  point  xn  x)  in  S.  Since  \xm  —  x\  <  \/n  if  m  >  n, 
lim„^oo  x„  =  x.  TJ 

We  will  use  the  next  theorem  to  show  that  continuity  can  be  defined  in  terms  of  se- 
quences. 

Theorem  4.2.5 

(a)  If{xn)  is  bounded,  then  {xn}  has  a  convergent  subsequence. 

(b)  If{xn}  is  unbounded  above,  then  {xn}  has  a  subsequence  {x„k}  such  that 

lim  x„k  =  oo. 

(c)  !f{xn}  is  unbounded  below,  then  {xn}  has  a  subsequence  {x„k}  such  that 

lim  x„k  =  —oo. 
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Proof  We  prove  (a)  and  leave  (b)  and  (c)  to  you  (Exercise  4.2.7).  Let  S  be  the 
set  of  distinct  numbers  that  occur  as  terms  of  {xn}.  (For  example,  if  {xn}  =  {(— 1)"}, 
S  =  {1,-1};  if  {xn}  =  {l,i,l,i,...,l,l/«,...},  S  =  {l,i,...,l/«,...}.)  If  S 
contains  only  finitely  many  points,  then  some  ~x  in  S  occurs  infinitely  often  in  {xn  };  that  is, 
{xn}  has  a  subsequence  {x„k }  such  that  x„k  =  x  for  all  k.  Then  lim^oo  xnk  =  x,  and  we 
are  finished  in  this  case. 

If  S  is  infinite,  then,  since  S  is  bounded  (by  assumption),  the  Bolzano-Weierstrass  the- 
orem (Theorem  1.3.8)  implies  that  S  has  a  limit  point  X.  From  Theorem  4.2.4,  there  is  a 
sequence  of  points  {y  j}  in  S,  distinct  from  X,  such  that 

lim  yj  =  x.  (4.2.5) 

j-MX} 

Although  each  yj  occurs  as  a  term  of  {xn},  {yj}  is  not  necessarily  a  subsequence  of  {x„\, 
because  if  we  write 

yj  =  xn  j  , 

there  is  no  reason  to  expect  that  {«/}  is  an  increasing  sequence  as  required  in  Defini- 
tion 4.2.1.  However,  it  is  always  possible  to  pick  a  subsequence  {«/jt}  of  {«/}  that  is 
increasing,  and  then  the  sequence  {y  jk}  =  {snj  }  is  a  subsequence  of  both  {yj}  and  {xn}. 
Because  of  (4.2.5)  and  Theorem  4.2.2  this  subsequence  converges  to  X.  H 

Continuity  in  Terms  of  Sequences 

We  now  show  that  continuity  can  be  defined  and  studied  in  terms  of  sequences. 

Theorem  4.2.6  Let  f  be  defined  on  a  closed  interval  [a,  b]  containing  X.  Then  f  is 
continuous  atx  (from  the  right  ifx  =  a,  from  the  left  if~x  =  b)  if  and  only  if 

lim  f(xn)  =  f(x)  (4.2.6) 

n— >oo 

whenever  {x„}  is  a  sequence  of  points  in  [a,  b]  such  that 

lim  xn  =  ~x.  (4.2.7) 

n— >oo 

Proof  Assume  that  a  <  ~x  <  b;  only  minor  changes  in  the  proof  are  needed  if  x  =  a  or 
~x  =  b.  First,  suppose  that  /  is  continuous  at  X  and  {xn}  is  a  sequence  of  points  in  [a,  b] 
satisfying  (4.2.7).  If  e  >  0,  there  is  a  S  >  0  such  that 

\f(x)-f(x)\  <€    if    |jc  — jcl  <  (4.2.8) 

From  (4.2.7),  there  is  an  integer  N  such  that  \x„  —  x|  <  S  if  n  >  N.  This  and  (4.2.8) 
imply  that  \f(xn)  —  f(x)\  <  e  if  n  >  N.  This  implies  (4.2.6),  which  shows  that  the  stated 
condition  is  necessary. 

For  sufficiency,  suppose  that  /  is  discontinuous  at  x.  Then  there  is  an  eo  >  0  such  that, 
for  each  positive  integer  n,  there  is  a  point  xn  that  satisfies  the  inequality 

\x„-x\  <  - 
n 
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while 

\f(xn)-m\  >€0. 

The  sequence  {xn}  therefore  satisfies  (4.2.7),  but  not  (4.2.6).  Hence,  the  stated  condition 
cannot  hold  if  /  is  discontinuous  at  X.  This  proves  sufficiency.  TJ 

Armed  with  the  theorems  we  have  proved  so  far  in  this  section,  we  could  develop  the 
theory  of  continuous  functions  by  means  of  definitions  and  proofs  based  on  sequences  and 
subsequences.  We  give  one  example,  a  new  proof  of  Theorem  2.2.8,  and  leave  others  for 
exercises. 

Theorem  4.2.7  If  f  is  continuous  on  a  closed  interval  [a,  b],  then  f  is  bounded  on 
[a,b]. 

Proof  The  proof  is  by  contradiction.  If  /  is  not  bounded  on  [a,b],  there  is  for  each 
positive  integer  n  a  point  xn  in  [a,  b]  such  that  |/(^«)|  >  n.  This  implies  that 

lim  |/(x„)|  =  oo.  (4.2.9) 

n— >oo 

Since  {xn}  is  bounded,  {x„\  has  a  convergent  subsequence  {x„k}  (Theorem  4.2.5(a)).  If 

x  =  lim  xnk , 

k— *oo 

then  x  is  a  limit  point  of  [a,  b],  so  x  €  [a,  b].  If  /  is  continuous  on  [a,b],  then 

lim  f(x„k)  =  f(x) 

k — >OQ 


by  Theorem  4.2.6,  so 


lim  \f{xnk)\  =  | f(x) I 

k — >co 


(Exercise  4.1.6),  which  contradicts  (4.2.9).  Therefore,  /  cannot  be  both  continuous  and 
unbounded  on  [a,  b]  TJ 

4.2  Exercises 


1.  Let  s„  =  (— 1)"(1  +  l/n).  Show  that  lim^oo  $nk  =  1  if  and  only  if  n^  is  even  for 
large  k,  lim^oo  s„k  =  —  1  if  and  only  if  n^  is  odd  for  large  k,  and  {s„k}  diverges 
otherwise. 

2.  Find  all  numbers  L  in  the  extended  reals  that  are  limits  of  some  subsequence  of  {s„} 
and,  for  each  such  L,  choose  a  subsequence  {s„k }  such  that  lim^^oo  s„k  =  L. 

(a)  sn  =  (-l)nn  (b)  sn  =  (l  +  i  j  cos  ^ 

/  \          (       \  \      nn  .  .  1 

(c)  s„  =    1   sin  —  (d)  s„  =  - 

\      n2- )       1  n 

(e)  sn  =  [(-1)"  +  1]  «2  (f )  sn  =  —  (sin  —  +  cos  — ) 


200    Chapter  4  Infinite  Sequences  and  Series 


3.  Construct  a  sequence  {s„}  with  the  following  property,  or  show  that  none  exists:  for 
each  positive  integer  m,  {sn}  has  a  subsequence  converging  to  m. 

4.  Complete  the  proof  of  Theorem  4.2.2. 

5.  Prove:  If  lim„^oo  s„  =  s  and  {s„}  has  a  subsequence  {s„k }  such  that  (—  I)  s„k  >  0, 
then  s  =  0. 

6.  Complete  the  proof  of  Theorem  4.2.3. 

7.  Prove  Theorem  4.2.5(b)  and  (c) . 

8.  Suppose  that  {sn}  is  bounded  and  all  convergent  subsequences  of  {sn}  converge  to 
the  same  limit.  Show  that  {sn}  is  convergent.  Give  an  example  showing  that  the 
conclusion  need  not  hold  if  {sn}  is  unbounded. 

9.  (a)   Let  /  be  defined  on  a  deleted  neighborhood  N  of  X.  Show  that 


if  and  only  if  limn^oo  /  (x„)  =  L  whenever  {x„}  is  a  sequence  of  points  in  N 
such  that  lim„^oo  xn  =  X.  Hint:  See  the  proof  of  Theorem  4.2.6. 
(b)    State  a  result  like  (a)  for  one-sided  limits. 

10.  Give  a  proof  based  on  sequences  for  Theorem  2.2.9.  Hint:  Use  Theorems  4.1.6, 
4.2.2,  4.2.5,  and  4.2.6. 

11.  Give  a  proof  based  on  sequences  for  Theorem  2.2.12. 

12.  Suppose  that  /  is  defined  on  a  deleted  neighborhood  N  of  X  and  {f(xn)}  ap- 
proaches a  limit  whenever  {xn}  is  a  sequence  of  points  in  N  and  lim„^oo  xn  = 
~x.  Show  that  if  {xn}  and  {yn}  are  two  such  sequences,  then  lim„^oo  f(x„)  = 
lim„^.oo  / (yn).  Infer  from  this  and  Exercise  4.2.9  that  limx^  f  (x)  exists. 

13.  Prove:  If  /  is  defined  on  a  neighborhood  N  of  X,  then  /  is  differentiable  at  X  if  and 
only  if 


n^oo        Xn  —  X 

exists  whenever  {xn }  is  a  sequence  of  points  in  N  such  that  xn  ^  X  and  liirin^oo  xn  = 
x.  HINT:  Use  Exercise  4.2.12. 


The  theory  of  sequences  developed  in  the  last  two  sections  can  be  combined  with  the  fa- 
miliar notion  of  a  finite  sum  to  produce  the  theory  of  infinite  series.  We  begin  the  study  of 
infinite  series  in  this  section. 

Definition  4.3.1  If  is  an  infinite  sequence  of  real  numbers,  the  symbol 


lim  /  (x)  =  L 


4.3  INFINITE  SERIES  OF  CONSTANTS 


OO 


n=k 
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is  an  infinite  series,  and  an  is  the  nth  term  of  the  series.  We  say  that  Y^n=k  an  converges  to 
the  sum  A,  and  write 

oo 

^an  =  A, 

n=k 

if  the  sequence  {^4«}£°  defined  by 

An  =  ak  +  ak+1 -\  \-an,    n  >  k, 

converges  to  A.  The  finite  sum  An  is  the  nth  partial  sum  of  Yln=k  a"  -  ^  diverges, 
we  say  that  Y^=k  a"  diverges;  in  particular,  if  lim„^oo  An  =  oo  or  — oo,  we  say  that 
Yln^=k  an  diverges  to  oo  or  —oo,  and  write 

oo  oo 

an  =  oo    or         an  =  —oo. 

n=k  n=k 

A  divergent  infinite  series  that  does  not  diverge  to  ±00  is  said  to  oscillate,  or  be  oscillatory. 


We  will  usually  refer  to  infinite  series  more  briefly  as  series. 


Example  4.3.1  Consider  the  series 

00 

J2r"-  -l<r<l. 

n=0 

Here  an  =  r"  (n  >  0)  and 

An  =  1  +  r  +  r2  +  ■  ■  ■  +  r"  =  1  ~  *  +  ,  (4.3.1) 

i—r 

which  converges  to  1/(1  —  r)  as  n  — ►  00  (Example  4.1.11);  thus,  we  write 

00 

Vr"  =   ,     -1  <  r  <  1. 

n=o 

If  |r  I  >  1,  then  (4.3.1)  is  still  valid,  but  J2T=o  r"  diverges;  if  r  >  1,  then 

00 

^  r"  =  00,  (4.3.2) 
«=o 

while  if  r  <  —I,  X^^Lo  r"  oscillates,  since  its  partial  sums  alternate  in  sign  and  their 
magnitudes  become  arbitrarily  large  for  large  n.  If  r  =  —1,  then  ^2m+i  =  0  and  A-im  =  1 
for  m  >  0,  while  if  r  =  1,  An  =  n  +  1;  in  both  cases  the  series  diverges,  and  (4.3.2)  holds 
if  r  =  1.  ■ 
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The  series  ^^L0  r"  is  called  the  geometric  series  with  ratio  r.  It  occurs  in  many  appli- 
cations. 

An  infinite  series  can  be  viewed  as  a  generalization  of  a  finite  sum 

N 

A  =  ^2ian  =  ak  +  ak+1  H  \- aN 

n=k 

by  thinking  of  the  finite  sequence  {ak,  ak+i,  •  •  • ,  fljv}  as  being  extended  to  an  infinite  se- 
quence with  an  =  0  for  n  >  N .  Then  the  partial  sums  of  Y^=k  a"  are 

An  =  ak  +  ak+i  -\  h  an,    k  <n  <  N, 

and 

An  =  A,    n>  N; 

that  is,  the  terms  of  {An}™  equal  the  finite  sum  A  for  n  >  k.  Therefore,  lim„^oo  An 
=  A. 

The  next  two  theorems  can  be  proved  by  applying  Theorems  4. 1 .2  and  4. 1 .8  to  the  partial 
sums  of  the  series  in  question  (Exercises  4.3.1  and  4.3.2). 

Theorem  4.3.2  The  sum  of  a  convergent  series  is  unique. 

Theorem  4.3.3  Let 

oo  oo 

an  =  A    and         bn  =  B, 

n=k  n=k 
where  A  and  B  are  finite.  Then 

oo 

}{can)  =  cA 

n=k 

if  c  is  a  constant, 

oo 

J2(an  +  b„)  =  A  +  B. 

n=k 

and 

oo 

YJifln-bn)  =  A-B. 

n=k 

These  relations  also  hold  if  one  or  both  of  A  and  B  is  infinite,  provided  that  the  right  sides 
are  not  indeterminate. 

Dropping  finitely  many  terms  from  a  series  does  not  alter  convergence  or  divergence, 
although  it  does  change  the  sum  of  a  convergent  series  if  the  terms  dropped  have  a  nonzero 
sum.  For  example,  suppose  that  we  drop  the  first  k  terms  of  a  series  ^^L0  an ,  and  consider 
the  new  series  Yln^k  a"  ■  Denote  the  partial  sums  of  the  two  series  by 

A„  =  a0  +  a\  -I  \-a„,  n>0, 

and 

A'n  =  ak  +  ak+1 -\  Yan,  n>k. 
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Since 

An  =  (do  +  (ii  H  1-  flfe_i)  +  4,,    n  >  k, 

it  follows  that  =  linin^oo  An  exists  (in  the  extended  reals)  if  and  only  if  A'  =  lim„^.oo  A'n 
does,  and  in  this  case 

A  =  (a0  +  fli  H  h  flfc-i)  + 

An  important  principle  follows  from  this. 

Lemma  4.3.4  Suppose  that  for  n  sufficiently  large  (that  is,  for  n  >  jome  integer  N) 
the  terms  o/5Zn=fc  a„  satisfy  some  condition  that  implies  convergence  of  an  infinite  series. 
Then  $^n=£  converges.  Similarly,  suppose  that  for  n  sufficiently  large  the  terms  ^^L/t  <3« 
satisfy  some  condition  that  implies  divergence  of  an  infinite  series.  Then  5^n=£  a«  diverges. 

Example  4.3.2  Consider  the  alternating  series  test,  which  we  will  establish  later  as  a 
special  case  of  a  more  general  test: 

The  series        an  converges  if(—\)nan  >  0,  |an+i|  <  \a„\,  and  limn-+0O  an  =  0. 

The  terms  of 

y,  16  +(-2)" 

n2n 

n=l 

do  not  satisfy  these  conditions  for  all  n  >  1,  but  they  do  satisfy  them  for  sufficiently  large 
n.  Hence,  the  series  converges,  by  Lemma  4.3.4.  ■ 

We  will  soon  give  several  conditions  concerning  convergence  of  a  series  Yl™=k  a"  witn 
nonnegative  terms.  According  to  Lemma  4.3.4,  these  results  apply  to  series  that  have  at 
most  finitely  many  negative  terms,  as  long  as  an  is  nonnegative  and  satisfies  the  conditions 
for  n  sufficiently  large. 

When  we  are  interested  only  in  whether  YHv=k  an  converges  or  diverges  and  not  in  its 
sum,  we  will  simply  say  "^a«  converges"  or  "^a«  diverges."  Lemma  4.3.4  justifies 
this  convention,  subject  to  the  understanding  that  ^  an  stands  for  J^^L^  an,  where  k  is  an 
integer  such  that  a„  is  defined  for  n  >  k.  (For  example, 

1  °°  1 

  stands  for     >  , 

(n  -  6)2  ^  (n  -  6)2 

n=k 

where  k  >  7.)  We  write  ^  an  =  oo  (— oo)  if  ^  an  diverges  to  oo  (— oo).  Finally,  let  us 
agree  that 

oo  oo 

^2,an    and       ^  an+j 

n=k  n—k—j 

(where  we  obtain  the  second  expression  by  shifting  the  index  in  the  first)  both  represent  the 
same  series. 
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Cauchy's  Convergence  Criterion  for  Series 

The  Cauchy  convergence  criterion  for  sequences  (Theorem  4.1.13)  yields  a  useful  criterion 
for  convergence  of  series. 

Theorem  4.3.5  (Cauchy's  Convergence  Criterion  for  Series)  A  series 
On  converges  if  and  only  if  for  every  e  >  0  there  is  an  integer  N  such  that 

\a„  +  an+\  H  h  am\  <  e    if   m>n>N.  (4.3.3) 

Proof   In  terms  of  the  partial  sums  {An}  of  £  a„, 

a„  +  a„+\  +  V  am  =  Am  —  An-\. 

Therefore,  (4.3.3)  can  be  written  as 

\Am  —  A„-i  |  <  e    if   m  >  n  >  N. 

Since  ^a„  converges  if  and  only  if  {An}  converges,  Theorem  4.1.13  implies  the  conclu- 
sion. TJ 

Intuitively,  Theorem  4.3.5  means  that  ^  an  converges  if  and  only  if  arbitrarily  long  sums 

a„  +  an+i  H  h  am,  m>n, 

can  be  made  as  small  as  we  please  by  picking  n  large  enough. 

Example  4.3.3  Considerthe  geometric  series  J2  '""  of  Example  4.3.1.  If  \r  \  >  1,  then 
{rn}  does  not  converge  to  zero.  Therefore  J]  rn  diverges,  as  we  saw  in  Example  4.3.1.  If 
\r  I  <  1  and  m  >  n,  then 


A„  \  =  \rn+l  +rn+2  +  ...  +  rm\ 

<  |r|"+1(l  +  \r\  +  ■■■+  \r\m-"-1) 

I  _  \y\m-n        i^.in  +  1 


(4.3.4) 


|r|»+i  ___  < 


l-|r| 


If  e  >  0,  choose  N  so  that 


Then  (4.3.4)  implies  that 


\r\N+1 


<  €. 


\Am-A„\<e    if  m>n>N. 
Now  Theorem  4.3.5  implies  that  ^  r"  converges  if  \r\  <  1,  as  in  Example  4.3.1. 

Letting  m  =  n  in  (4.3.3)  yields  the  following  important  corollary  of  Theorem  4.3.5. 

Corollary  4.3.6  lf^an  converges,  then  lim„^oo  an  =  0. 
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It  must  be  emphasized  that  Corollary  4.3.6  gives  a  necessary  condition  for  convergence; 
that  is,  an  cannot  converge  unless  limn^oo  an  =  0.  The  condition  is  not  sufficient,  ^  an 
may  diverge  even  if  limn^oo  an  =  0.  We  will  see  examples  below. 

We  leave  the  proof  of  the  following  corollary  of  Theorem  4.3.5  to  you  (Exercise  4.3.5). 

Corollary  4.3.7  If*Y_,an  converges,  then  for  each  e  >  0  there  is  an  integer  K  such 
that 


n=k 


<€    if  k  >  K; 


that  is, 


oo 

lim       an  =  0. 


k— >oo 


n=k 


Example  4.3.4  If  \r\  <  l,then 


Therefore,  if 


then 


n=k 


n  =k 


rkJ2r" 

n=0 


1  -r 


\K 


1  -r 


<  €, 


n=k 


<e    if    k  >  K, 


which  implies  that  limyt^oo  Yl^k  r"  =  ®- 


Series  of  Nonnegative  Terms 

The  theory  of  series  J]  an  with  terms  that  are  nonnegative  for  sufficiently  large  n  is  simpler 
than  the  general  theory,  since  such  a  series  either  converges  to  a  finite  limit  or  diverges  to 
oo,  as  the  next  theorem  shows. 

Theorem  4.3.8  If  an  >  0  for  n  >  k,  then  converges  if  its  partial  sums  are 

bounded,  or  diverges  to  oo  they  are  not.  These  are  the  only  possibilities  and,  in  either 
case. 


where 


2^  an  =  sup  {An  \  n  >  k}, 

n=k 

An=ak+ak+i-\  \-a„,  n>k. 
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Proof  Since  An  =  An-\  +a„  and  an  >  0  (n  >  k),  the  sequence  {An}  is  nondecreasing, 
so  the  conclusion  follows  from  Theorem  4.1.6(a)  and  Definition  4.3.1. 

U 

If  dn  >  0  for  sufficiently  large  n,  we  will  write  ^  an  <  oo  if  ^  an  converges.  This  con- 
vention is  based  on  Theorem  4.3.8,  which  says  that  such  a  series  diverges  only  if  ^  an  = 
oo.  The  convention  does  not  apply  to  series  with  infinitely  many  negative  terms,  because 
such  series  may  diverge  without  diverging  to  oo;  for  example,  the  series  ^^L0(—  os~ 
cillates,  since  its  partial  sums  are  alternately  1  and  0. 

Theorem  4.3.9  (The  Comparison  Test)  Suppose  that 

0<an<bn,    n>k.  (4.3.5) 

Then 

(a)  J2an  <  OO  ifY.bn  <  OO- 

(b)  J2bn  =  OO  ifJ2an  =  OO. 

Proof    (a)  If 

A„  =  ak  +  ak+1  H  \- an    and    Bn  =  bk  +  bk+1  H  !-/?„,    «  >  k, 

then,  from  (4.3.5), 

<  Bn.  (4.3.6) 

Now  we  use  Theorem  4.3.8.  If  ^  bn  <  oo,  then  is  bounded  above  and  (4.3.6)  implies 
that  {An}  is  also;  therefore,  ^an  <  oo.  On  the  other  hand,  if  =  oo,  then  {An}  is 

unbounded  above  and  (4.3.6)  implies  that  {Bn}  is  also;  therefore,     bn  =  oo. 

We  leave  it  to  you  to  show  that  (a)  implies  (b).  TJ 
Example  4.3.5  Since 

—  <rn,  n>\, 
n 

and  ^  r"  <  oo  if  0  <  r  <  1,  the  series  ^  r"/n  converges  if  0  <  r  <  1,  by  the  comparison 
test.  Comparing  these  two  series  is  inconclusive  if  r  >  1,  since  it  does  not  help  to  know 
that  the  terms  of  ^  rn  jn  are  smaller  than  those  of  the  divergent  series  ^r".  If  r  <  0,  the 
comparison  test  does  not  apply,  since  the  series  then  have  infinitely  many  negative  terms. 


Example  4.3.6  Since 

r"  <  nr" 

and^r"  =  oo  if  r  >  1,  the  comparison  test  implies  that  ^«r"  =  ooifr  >  1.  Compar- 
ing these  two  series  is  inconclusive  if  0  <  r  <  1,  since  it  does  not  help  to  know  that  the 
terms  of  ^  nr"  are  larger  than  those  of  the  convergent  series  ^  r".  ■ 
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The  comparison  test  is  useful  if  we  have  a  collection  of  series  with  nonnegative  terms 
and  known  convergence  properties.  We  will  now  use  the  comparison  test  to  build  such  a 
collection. 

Theorem  4.3.10  (The  Integral  Test)  Let 

c„  =  f(n),    n>k,  (4.3.7) 
where  f  is  positive,  nonincreasing,  and  locally  integrable  on  [k,  oo).  Then 

J2cn<oo  (4.3.8) 

if  and  only  if 

f(x)  dx  <  oo.  (4.3.9) 


I 

Jk 


Ik 

Proof   We  first  observe  that  (4.3.9)  holds  if  and  only  if 

/       fix)  dx  <  oo  (4.3.10) 

.  Jn 
n=k 

(Exercise  4.3.9),  so  it  is  enough  to  show  that  (4.3.8)  holds  if  and  only  if  (4.3.10)  does.  From 

(4.3.7)  and  the  assumption  that  /  is  nonincreasing, 

cn+l  =  f(n  +  1)  <  f(x)  <  f(n)  =  c„,    n  <  x  <  n  +  1,    n  >  k. 

Therefore, 

t>n  +  \  i>n  +  \  i>n  +  \ 

Cn+i  =  I       cn+1dx<  I       f(x)dx<  I       c„dx  =  c„,  n>k 

Jn  Jn  Jn 

(Theorem  3.3.4).  From  the  first  inequality  and  Theorem  4.3.9(a)  with  an  =  cn+\  and 
b„  =  f„+1  f{x)dx,  (4.3.10)  implies  that  ^cn+\  <  oo,  which  is  equivalent  to  (4.3.8). 
From  the  second  inequality  and  Theorem  4.3.9(a)  with  a„  =         / (x)  dx  and  b„  =  c„, 

(4.3.8)  implies  (4.3.10).  '  U 

Example  4.3.7  The  integral  test  implies  that  the  series 

Y—>    T  l-  >    and    T  To 

nP      ^  n(logn)P  ^  n  \ogn  [log(log«)]p 

converge  if  p  >  1  and  diverge  if  0  <  p  <  1,  because  the  same  is  true  of  the  integrals 

r°°  dx     r°°    dx       ^    r°°  dx 

J  a      XP'      Ja      x(\0gx)P'  Ja      X  log  X  [log(log  x)]^7 

if  a  is  sufficiently  large.  (See  Example  3.4.3  and  Exercise  3.4.10.)  The  three  series  di- 
verge if  p  <  0:  the  first  by  Corollary  4.3.6,  the  second  by  comparison  with  the  divergent 
series  ^l/n,  and  the  third  by  comparison  with  the  divergent  series  ^  l/(«  log  n).  (The 
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divergence  of  the  last  two  series  for  p  <  0  also  follows  from  the  integral  test,  but  the 
divergence  of  the  first  does  not.  Why  not?)  These  results  can  be  generalized:  If 

Lo(x)  =  x    and    L^ix)  =  logfL^-i (x)],    k  >  1, 

then 

V  \  

^  L0{n)L1{n)---Lk(n)[Lk+1(n)]P 
converges  if  and  only  if  p  >  1  (Exercise  4.3.1 1).  ■ 

This  example  provides  an  infinite  family  of  series  with  known  convergence  properties 
that  can  be  used  as  standards  for  the  comparison  test. 

Except  for  the  series  of  Example  4.3.7,  the  integral  test  is  of  limited  practical  value, 
since  convergence  or  divergence  of  most  of  the  series  to  which  it  can  be  applied  can  be 
determined  by  simpler  tests  that  do  not  require  integration.  However,  the  method  used  to 
prove  the  integral  test  is  often  useful  for  estimating  the  rate  of  convergence  or  divergence 
of  a  series.  This  idea  is  developed  in  Exercises  4.3.13  and  4.3.14. 


Example  4.3.8  The  series 


°°  i 

^WTn^  (4311) 


converges  if  q  >  1/2,  by  comparison  with  the  convergent  series  £  \/n2q ,  since 


1  1 

<  n  >  1. 


(n2  +  n)t  n2i 
This  comparison  is  inconclusive  if  q  <  1/2,  since  then 


T  — 

L^,  „2q 


00, 

and  it  does  not  help  to  know  that  the  terms  of  (4.3.1 1)  are  smaller  than  those  of  a  divergent 
series.  However,  we  can  use  the  comparison  test  here,  after  a  little  trickery.  We  observe 
that 

 =  ^  -^r-  =  oo-    Q  <  1/2, 

(n  +  l)2?      ^  n2i         >    i  -  i  > 

n=k—\  n=k 

and 

1  1 

< 


(«  +  1)2<?      (n2  +  n)i ' 
Therefore,  the  comparison  test  implies  that 

E-J   =  oo.  q  <  1/2. 
(n2  +  n)i  H  ~  ' 
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The  next  theorem  is  often  applicable  where  the  integral  test  is  not.  It  does  not  require  the 
kind  of  trickery  that  we  used  in  Example  4.3.8. 

Theorem  4.3.11  Suppose  that an  >  0andbn  >  Oforn  >  k.  Then 

(a)  >   an  <  oo    if    /   bn  <  oo  lim  a„/b„  <  oo. 

(b)  ^^««  =  oo    z/  =  oo    one/     lim  an/bn  >  0. 

Proof  (a)  If  lim„^oo  an/bn  <  oo,  then  {an/bn}  is  bounded,  so  there  is  a  constant  M 
and  an  integer  k  such  that 

a„  <  Mbn,    n  >  k. 

Since  ^bn  <  oo,  Theorem  4.3.3  implies  that  ^(Mo„)  <  oo.  Now  ^a„  <  oo,  by  the 
comparison  test. 

(b)  If  lim„^^  an/bn  >  0,  there  is  a  constant  m  and  an  integer  k  such  that 

a„  >  wo„,    n  >  k. 

Since  ^bn  =  oo,  Theorem  4.3.3  implies  that  ^(mbn)  =  oo.  Now  =  oo,  by  the 

comparison  test.  TJ 

Example  4.3.9  Let 

1  2  +  sin«:r/6 

Then 

a„  2  +  sin«:r/6 


0„        (1  +  (1-1/71)9 


lim  — -  =  3    and     lim  — —  =  1. 

n^°°  On  n^oo  0« 


Since  ^  o„  <  oo  if  and  only  if  p  +  q  >  1,  the  same  is  true  of  ^  a„,  by  Theorem  4.3.1 1. 


The  following  corollary  of  Theorem  4.3. 1 1  is  often  useful,  although  it  does  not  apply  to 
the  series  of  Example  4.3.9. 

Corollary  4.3.12  Suppose  that an  >  0andbn  >  Oforn  >  k,  and 

lim  —  =  L, 

where  0  <  L  <  oo.  Then  ^  an  and     bn  converge  or  diverge  together. 
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Example  4.3.10  With  this  corollary  we  can  avoid  the  kind  of  trickery  used  in  the 
second  part  of  Example  4.3.8,  since 

1/1  1 
lim  — -   /  -— —  =  lim   =  1, 

«^oo  (n2  +  n)«  /  n2ct     «^oo  (l  +  IjrCfi 


so 


1  and    V — 

^  (n2  +  n)i    an      ^  n2i 


converge  or  diverge  together. 


The  Ratio  Test 

It  is  sometimes  possible  to  determine  whether  a  series  with  positive  terms  converges  by 
comparing  the  ratios  of  successive  terms  with  the  corresponding  ratios  of  a  series  known  to 
converge  or  diverge. 

Theorem  4.3.13  Suppose  thatan  >  0.  bn  >  0,  and 

-^±1  <  (4.3.12) 

a„  b„ 

Then 

(a)  J2an  <  OO  ifJ2bn  <  OO. 

(b)  J2bn  =  00  ifHan  =  OO. 

Proof   Rewriting  (4.3.12)  as 

Q-n  +  l  dn 

b„+\  ~  bn ' 

we  see  that  {a„/bn}  is  nonincreasing.  Therefore,  lim„^oo  a„/b„  <  oo,  and  Theorem  4.3.1 1(a) 
implies  (a). 

To  prove  (b),  suppose  that  ^an  =  oo.  Since  {a„/b„}  is  nonincreasing,  there  is  a 
number  p  such  that  bn  >  pan  for  large  n.  Since  ^2(pa„)  =  oo  if  ^an  =  oo,  Theo- 
rem 4.3.9(b)  (with  an  replaced  by  pan)  implies  that  ^  bn  =  oo.  H 

We  will  use  this  theorem  to  obtain  two  other  widely  applicable  tests:  the  ratio  test  and 
Raabe's  test. 

Theorem  4.3.14  (The  Ratio  Test)  Suppose  that  a„  >  Oforn  >  k.  Then 

(a)  J2an  <  oo  if]\mn-+ooan+i/an  <  1. 

(b)  Yan  =  oo  if  \im„^an+1 /an  >  1. 
// 

lim  <  1  <  lim  -^±£,  (4.3.13) 

n^oo    a„  n~*oo  a„ 

then  the  test  is  inconclusive;  that  is,  ^  an  may  converge  or  diverge. 
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Proof   (a)  If 

■p       an  +  l  , 

hm    <  1, 

there  is  a  number  r  such  that  0  <  r  <  1  and 


<  r 

for  n  sufficiently  large.  This  can  be  rewritten  as 


Since  ^  r"  <  oo,  Theorem  4.3.13(a)  with  bn  =  r"  implies  that  £  a„  <  oo. 
(b)lf 


hm    >  1, 


there  is  a  number  r  such  that  r  >  1  and 

an+i 


>  r 
a„ 

for  n  sufficiently  large.  This  can  be  rewritten  as 

  >  


a„ 


Since  £^r"  =  oo,  Theorem 4.3.13(b)  witha„  =  r"  implies  that  *^bn  =  oo.  TJ 
To  see  that  no  conclusion  can  be  drawn  if  (4.3.13)  holds,  consider 


This  series  converges  if  p  >  1  or  diverges  if  p  <  1 ;  however, 


hm   =  hm    =  1 


for  every  p . 

Example  4.3.11  If 

then 


Ea«  =  E(2  +  sin!y) r"- 

.    (n  +  1)tt 

2  +  sin  

fln+i  2 
 =  r  


H7T 

2  +  sin  — 
2 

which  assumes  the  values  3r/2,  2r/3,  r/2,  and  2r,  each  infinitely  many  times;  hence, 

  an  +  l  an  +  l  r 

lim   =  2r       and        lim   =  -. 

«->oo   a„  „^oo   a«  2 

Therefore,  *^an  converges  ifO  <  r  <  1/2  and  diverges  if  r  >  2.  The  ratio  test  is 
inconclusive  ifl/2<r<2.  ■ 
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The  following  corollary  of  the  ratio  test  is  the  familiar  ratio  rest  from  calculus. 
Corollary  4.3.15  Suppose  thata„  >  0  (n  >  k)  and 

..       an  +  l  , 

lim   =  L. 

Then 

(a)  ^2an  <  oo  if  L  <  1. 

(b)  J2an=ooifL>  1. 

The  test  is  inconclusive  if  L  =  1. 

Example  4.3.12  The  series  *^an  =  ^nrn~l  converges  if  0  <  r  <  1  or  diverges  if 
r  >  1,  since 


a„  nrn~l         \       n J 


so 

lim 


n^oo  a„ 

Corollary  4.3.15  is  inconclusive  if  r  =  1,  but  then  Corollary  4.3.6  implies  that  the  series 
diverges.  ■ 

The  ratio  test  does  not  imply  that  ^  an  <  oo  if  merely 

^77-1-1 

-^ti  <  1  (4.3.14) 
a» 

for  large  n,  since  this  could  occur  with  lim„^oo  an+i/an  =  1,  in  which  case  the  test  is 
inconclusive.  However,  the  next  theorem  shows  that  ^  a„  <  oo  if  (4.3.14)  is  replaced  by 
the  stronger  condition  that 

an+\  <X_P_ 
an    ~  n 

for  some  p  >  1  and  large  n.  It  also  shows  that  ^  an  =  oo  if 

an+\  >  1  _  q_ 
an    ~  n 

for  some  q  <  1  and  large  n . 

Theorem  4.3.16  (Raabe's  Test)  Suppose  that  an  >  0  for  large  n.  Let 

—     / an+\       \                            ( an+\ 
M  =  lim  n  I  1  I     and    m  =  lim  n  |  1 

n^oo     \    a„  J  „^oo 

Then 

(a)  Ys°n  <  °o  ifM  <  —  1. 

(b)  ^an  =  oo  ifm  >  —  1. 
The  test  is  inconclusive  ifm  <  —  1  <  M. 
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Proof    (a)  We  need  the  inequality 

1 


>  1  -px,    x  >  0,  p  >  0.  (4.3.15) 


(1+*)* 

This  follows  from  Taylor's  theorem  (Theorem  2.5.4),  which  implies  that 

1  1    P(P+l)  2 

-  1  —  px  +  ——   X  , 


(l+x)P  '        2  (1  +c)P+2' 

where  0  <  c  <  x.  (Verify.)  Since  the  last  term  is  positive  if  p  >  0,  this  implies  (4.3.15). 
Now  suppose  that  M  <  —p  <  —  1.  Then  there  is  an  integer  k  such  that 


( Cl"+1     i\  ^ 
n  I  1  I  <  —p,    n  >  k, 


so 

a„+i  p 

  <  1  ,  n>k. 

a„  n 


Hence, 


a„+i  1 

<  ;  .    n  >  K, 


a»       (l  +  l/n)P 
as  can  be  seen  by  letting  x  =  l/n  in  (4.3.15).  From  this, 

  <    /  — ,    n  >  k. 

a„       (n  +  \)P I  nP 

Since  ^  \/np  <  oo  if  p  >  1,  Theorem  4.3.13(a)  implies  that  ^an  <  oo. 
(b)  Here  we  need  the  inequality 

(1  -  xf  <  1  -  qx,    0<x<l,    0<q<l.  (4.3.16) 

This  also  follows  from  Taylor's  theorem,  which  implies  that 

(1  -  xf  =  1  -qx  +  q(q  -  1)(1  -  C)""2y , 

where  0  <  c  <  x . 

Now  suppose  that  —  1  <  —  q  <  m.  Then  there  is  an  integer  k  such  that 


n  I  1  I  >  —  q,    n  >  k, 

\  a„  J 


so 

  >  1  ,    n  >k. 

a„  n 

If  q  <  0,  then  ^  an  =  oo,  by  Corollary  4.3.6.  Hence,  we  may  assume  that  0  <  q  <  1,  so 
the  last  inequality  implies  that 
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as  can  be  seen  by  setting  x  =  l/n  in  (4.3.16).  Hence, 

a„+\  III 

  >  —  /  ,    n  >  k. 

Since  ^  \/nq  =  oo  if  q  <  1,  Theorem  4.3.13(b)  implies  that  £]a„  =  oo.  H 
Example  4.3.13  If 

an  =  /  — ;  ;  ,      Q!  >  0, 

^  a  (a  +  l)(a  +  2)  ■  ■  ■  (a  +  n  -  1) 

then 

..     fln+i       ,.      « + 1  , 
hm    =  hm    =  1, 

n-Hjo    a„         n->-oo  a  +  n 

so  the  ratio  test  is  inconclusive.  However, 


lim  n  (  "+1  —  1  )  =  lim  n  \ — —  

«->oo     \   an  J       n^oo     \a  +  n 


n(l  —  a) 
=  lim    =  1  —  a, 

«->oo   a  +  n 

so  Raabe's  test  implies  that  £  a„  <  oo  if  a  >  2  and  ^  fl«  =  oo  if  0  <  a  <  2.  Raabe's 
test  is  inconclusive  if  a  =  2,  but  then  the  series  becomes 


En!       _  ^  1 
(n  +  1)!  ~  ^  n  +  1' 


which  we  know  is  divergent. 

Example  4.3.14  Consider  the  series  ^  a„ ,  where 

(m!)2 


<^2m  — 


a(a  +  1)  •  •  •  (a  +  m)/3(/J  +  1)  •  •  •  OS  +  m) 
and 

(m\)2(m  +  1) 


<22m  +  l  — 


a  (or  +  1)  •  •  •  (a  +  m)/3(0  +  1)  •  •  •  {fi  +  m  +  1)  * 
with  0  <  a  <  p.  Since 

1  j  =  2mP 


and 


2m(a-^±l-l)=2m(  om+l  l) 

\  a2m        J  \P+m  +  l       )        P  +  m+l 

.  .,/a2m+2     ,\  ,  m  +  l         \  (2m+l)a 

(2m  +  1)  1    =  (2m  +  1)  — ■  —  -  1 

V«2m+i       /  \a  +  m  +  l  J 


a  +  m  +  1 


we  have 


lim  n  (   "+1  —  1  ]  =  —2a  and     lim  «  (    "+1  —  1  ]  =  —2/3. 
«^oo    y  a„        y  y  a„  J 

Raabe's  test  implies  that  ^an  <  oo  if  a  >  1/2  and  *^an  =  oo  if  /3  <  1/2.  The  test  is 
inconclusive  ifO<a<l/2</3.  ■ 
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The  next  theorem,  which  will  be  useful  when  we  study  power  series  (Section  4.5),  con- 
cludes our  discussion  of  series  with  nonnegative  terms. 

Theorem  4.3.17  (Cauchy's  Root  Test)  Ifa„  >  Oforn  >  k,  then 

(a)  J2an  <  00  '/'linWoo  a«/n  < 

(b)  J2°n  =  oo  //'lim„_>00  ai'n  >  1. 

— —  \  I  n 

The  test  is  inconclusive  if  \imn^>.00  an     =  1. 

Proof  (a)  If  limn-^oo  alJn  <  1,  there  is  an  r  such  that  0  <  r  <  1  and  a„  <  r  for 
large  n.  Therefore,  an  <  r"  for  large  n.  Since  ^  r"  <  oo,  the  comparison  test  implies  that 

(b)  If  lim„^oo  >  1,  then  a„  >  1  for  infinitely  many  values  of  n,  so  Yl  an  =  oo, 
by  Corollary  4.3.6.  U 


Example  4.3.15  Cauchy's  root  test  is  inconclusive  if 
because  then 

lim  a*'"  =  lim  I  —  I      =  lim  exp  (  log«)  =  1 

for  all  p .  However,  we  know  from  the  integral  test  that  ^\/np  <  oo  if  p  >  1  and 
=ooif><  1. 


Example  4.3.16  If 

£«»  =  £(2+sin^)V, 

then 

—    i/m      —  /  niz\ 

lim  aJ    =  lim  I  2  +  sin  —  )  r  =  3r, 

and  so  <  oo  if  r  <  1/3  and  =  oo  if  r  >  1/3.  The  test  is  inconclusive  if 

r  =  1/3,  but  then  |fl8m+2|  =  1  for  m  >  0,  so     an  =  oo,  by  Corollary  4.3.6. 


Absolute  and  Conditional  Convergence 

We  now  drop  the  assumption  that  the  terms  of  ^an  are  nonnegative  for  large  n.  In  this 
case,  £  an  may  converge  in  two  quite  different  ways.  The  first  is  defined  as  follows. 


Definition  4.3.18  A  series  ^  a„  converges  absolutely,  or  is  absolutely  convergent,  if 
\a„\  <  oo. 
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Example  4.3.17  A  convergent  series  ^  an  of  nonnegative  terms  is  absolutely  conver- 
gent, since  ^an  and  ^  \an\  are  the  same.  More  generally,  any  convergent  series  whose 
terms  are  of  the  same  sign  for  sufficiently  large  n  converges  absolutely  (Exercise  4.3.22). 


Example  4.3.18  Consider  the  series 


Esin«( 
nP 


(4.3.17) 


where  6  is  arbitrary  and  p  >  1 .  Since 


sin  nt 


n  p 


nP 


and  ^  l/np  <  oo  if  /?  >  1,  the  comparison  test  implies  that 


E 


np 


<  oo,     p  >  1. 


Therefore,  (4.3.17)  converges  absolutely  if  p  >  1. 
Example  4.3.19  IfO  <  p  <  1,  then  the  series 


(-1)" 


does  not  converge  absolutely,  since 


E 


(-if 


E- 


OO. 


However,  the  series  converges,  by  the  alternating  series  test,  which  we  prove  below.  ■ 

Any  test  for  convergence  of  a  series  with  nonnegative  terms  can  be  used  to  test  an  arbi- 
trary series  ^  an  for  absolute  convergence  by  applying  it  to  ^  \an\.  We  used  the  compar- 
ison test  this  way  in  Examples  4.3.18  and  4.3.19. 


Example  4.3.20  To  test  the  series 


a(a  +  1)  •••  (a  +  n  —  1) 
for  absolute  convergence,  we  apply  Raabe's  test  to 

n\ 


a  >  0, 


Efl"  =E 


a(a  +  1)  ••  •  (a  +  n  —  1) 


From  Example  4.3.13,  £  |a„  |  <  oo  if  a  >  2  and  £  |a„  |  =  oo  if  a  <  2.  Therefore,  ^  an 
converges  absolutely  if  a  >  2,  but  not  if  a  <  2.  Notice  that  this  does  not  imply  that  ^  an 
diverges  if  a  <  2.  ■ 
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The  proof  of  the  next  theorem  is  analogous  to  the  proof  of  Theorem  3.4.9.  We  leave  it  to 
you  (Exercise  4.3.24). 

Theorem  4.3.19  If  y^qw  converges  absolutely,  then  ^,an  converges. 

For  example,  Theorem  4.3.19  implies  that 

sin«# 


/7p 

converges  if  p  >  1,  since  it  then  converges  absolutely  (Example  4.3.18). 

The  converse  of  Theorem  4.3.19  is  false;  a  series  may  converge  without  converging  abso- 
lutely. We  say  then  that  the  series  converges  conditionally,  or  is  conditionally  convergent; 
thus,  ^(—l)"/«p  converges  conditionally  if  0  <  p  <  1. 

Dirichlet's  Test  for  Series 

Except  for  Theorem  4.3.5  and  Corollary  4.3.6,  the  convergence  tests  we  have  studied  so 
far  apply  only  to  series  whose  terms  have  the  same  sign  for  large  n.  The  following  theo- 
rem does  not  require  this.  It  is  analogous  to  Dirichlet's  test  for  improper  integrals  (Theo- 
rem 3.4.10). 

Theorem  4.3.20  (Dirichlet's  Test  for  Series)  The  series  Y^=kan°n  con- 
verges if 'lim„^oo  an  =  0, 

Wn+i  ~a»\  <  oo,  (4.3.18) 

and 

\bk  +  bk+1+---  +  bn\<M.     n>k,  (4.3.19) 

for  some  constant  M. 

Proof   The  proof  is  similar  to  the  proof  of  Dirichlet's  test  for  integrals.  Define 

Bn  =  bk  +  bk+1-\  \-b„,  n>k 

and  consider  the  partial  sums  of  Y^n=k  an  bn  '■ 

Sn  =  akbk  +  ak+1bk+1  H  h  anbn,    n>k.  (4.3.20) 

By  substituting 

bk  =  Bk    and    bn  =  Bn-Bn-\,  n>k+l, 

into  (4.3.20),  we  obtain 

S„  =  akBk  +  ak+i(Bk+i  —  Bk)  +  •  ■  •  +  a„(B„  —  B„-i), 

which  we  rewrite  as 

Sn  =  (ak  -  ak+i)Bk  +  (ak+i  -  ak+2)Bk+i  H   (4  3  21) 

+         —  an)Bn-i  +  a„B„. 
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(The  procedure  that  led  from  (4.3.20)  to  (4.3.21)  is  called  summation  by  parts.  It  is  analo- 
gous to  integration  by  parts.)  Now  (4.3.21)  can  be  viewed  as 

S„  =  Tn-\  +anBn,  (4.3.22) 

where 

Tn-i  =  (ak  —  ak+i)Bk  +  (flk+i  —  ak+2)Bk+\  +  h  (a«-i  —  an)B„-i; 

that  is,  { Tn }  is  the  sequence  of  partial  sums  of  the  series 

00 

J2(a.i  -aj+i)Bj.  (4.3.23) 
j=k 

Since 

\(aj  -aj+1)Bj\  <  M\aj  -a/+i| 

from  (4.3.19),  the  comparison  test  and  (4.3.18)  imply  that  the  series  (4.3.23)  converges 
absolutely.  Theorem  4.3.19  now  implies  that  {Tn}  converges.  Let  T  =  lim„^oo  Tn.  Since 
{Bn}  is  bounded  and  lim„^oo  an  =  0,  we  infer  from  (4.3.22)  that 

lim  Sn  =  lim  Tn-\  +  lim  anBn  =  T  +  0  =  T. 

n^-oo  n— >oo  n— >co 

Therefore,  ^  anbn  converges.  H 
Example  4.3.21  To  apply  Dirichlet's  test  to 

oo         .  a 

 ,     9  #  kn    (k  =  integer), 

n  +  (-1)"        r  6 

n=2  v  ' 

we  take 

a„  =    and    b„  =  sin  n6. 

n  +  (-1)" 

Then  lim^-xx,  an  =  0,  and 

i  .  3 

\a„+i  -a„  \  <  —  - 

n(n  —  1) 

(verify),  so 

^  |fln+i  -a„\  <  oo. 

Now 

Bn  =  sin  29  +  sin  39  +  ■  ■  ■  +  sin  n  6. 
To  show  that  {Bn}  is  bounded,  we  use  the  trigonometric  identity 

cos  (r  -  4)  9  -  cos  (r  +  4)  9 

sinr<9=  1  U  1  9^2kit. 

2sin(0/2) 
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to  write 

(cos  j9  -  cos  §0)  +  (cos  §0  —  cos  j9)  H  h  (cos  (n  -  |)  0  -  cos(«  +  ±)6») 

"  2  sin(6>/2) 

cos  j9  —  cos(«  +  i)0 
~  2sin(#/2)  ' 

which  implies  that 


\Bn\  < 


1 


«  >  2. 


sin(6>/2) 

Since        and  {bn\  satisfy  the  hypotheses  of  Dirichlet's  theorem,  £  fl/j^n  converges.  ■ 

Dirichlet's  test  takes  a  simpler  form  if  {«„}  is  nonincreasing,  as  follows. 

Corollary  4.3.21  (Abel's  Test)  The  series  ^anbn  converges  if  an+\  <  a„  for 
n  >  k,  lim„^oo  a„  =  0,  and 

\bk  +  bk+i  H  \-b„  \  <  M,  n>k, 

for  some  constant  M. 
Proof   If        <  an,  then 

m  m 

^  \an+i  —  an  \  =  ^(<2„  —  an+i)  =  ak  —  am+i. 

n=k  n=k 

Since  limm^oo  am+i  =  0,  it  follows  that 

oo 

^2  \a„+i  -a„  \  =  ak  <  oo. 

n=k 

Therefore,  the  hypotheses  of  Dirichlet's  test  are  satisfied,  so  ^  anbn  converges.  H 
Example  4.3.22  The  series 

Esin«# 
nP  ' 

which  we  know  is  convergent  if  p  >  1  (Example  4.3.18),  also  converges  if  0  <  p  <  1. 
This  follows  from  Abel's  test,  witha„  =  \/np  and  bn  =  sinn#  (see  Example  4.3.21).  ■ 

The  alternating  series  test  from  calculus  follows  easily  from  Abel's  test. 

Corollary  4.3.22  (Alternating  Series  Test)  The  series  XX- converges 
ifO  <  an+i  <  a„  andlimn^oo  an  =  0. 
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Proof  Let  bn  =  (—1)";  then  {|5„|}  is  a  sequence  of  zeros  and  ones  and  therefore 
bounded.  The  conclusion  now  follows  from  Abel's  test.  H 

Grouping  Terms  in  a  Series 

The  terms  of  a  finite  sum  can  be  grouped  by  inserting  parentheses  arbitrarily.  For  example, 

(1  +  7)  +  (6  +  5)  +  4  =  (1  +  7  +  6)  +  (5  +  4)  =  (1  +  7)  +  (6  +  5  +  4). 

According  to  the  next  theorem,  the  same  is  true  of  an  infinite  series  that  converges  or 
diverges  to  ±oo. 

Theorem  4.3.23  Suppose  that  J2T=k  an  =  A,  where  — oo  <  A  <  oo.  Let  {rij}™  be 
an  increasing  sequence  of  integers,  with  n\  >k.  Define 

bi  =  ak  H  h  anx, 

bi  =  ani+i  H  h  a„2, 


Then 


br  =  ««,._, +i  +  h  a„r 

oo 


j=l 

Proof  If  Tr  is  the  rth  partial  sum  of  J^JLi  and  {An}  is  the  «th  partial  sum  of 
T,T=k  as,thea 

Tr  =  b1+b2  +  ---  +  br 

=  («i  H  h  ««,)  +  (a„l+1  H  h  a„2)  H  h  (a„,._,+i  H  h  a„r) 

=  Anr . 

Thus,  {Tr}  is  a  subsequence  of  {An},  so  linv^oo  Tr  =  lim„^oo  An  =  A  by  Theorem  4.2.2. 
TJ 

Example  4.3.23  If  YlT=o(~ satisfies  the  hypotheses  of  the  alternating  series 
test  and  converges  to  the  sum  S,  Theorem  4.3.23  enables  us  to  write 


S  =  £(-l)na„  +  (~l)k+1  X>*+2;-i  -fl*+2y) 


and 


n=0 
k 


y'=i 


S  =  £(-l)*a„  + 


ak  +  \  -  -  Oi:+2/-l) 

./  =  1 


Since  0  <         <      these  two  equations  imply  that  S—Sk  is  between  0  and  (—  \)k  lQk+\ ■ 
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Example  4.3.24  Introducing  parentheses  in  some  divergent  series  can  yield  seem- 
ingly contradictory  results.  For  example,  it  is  tempting  to  write 

oo 

J2(-l)n+1  =  (1  -  1)  +  (1  -  1)  +  •  •  •  =  0  +  0  +  •  •  ■ 

n=l 

and  conclude  that  ^^Li(— 1)"  =  0,  but  equally  tempting  to  write 

oo 

=  i  —  (i  —  i)  —  ci  —  l)  

=  1-0-0  

and  conclude  that  5Z5i=i(— 1)"+1  =  1.  Of  course,  there  is  no  contradiction  here,  since 
Theorem  4.3.23  does  not  apply  to  this  series,  and  neither  of  these  operations  is  legitimate. 

Rearrangement  of  Series 

A  finite  sum  is  not  changed  by  rearranging  its  terms;  thus, 

1  +  3  +  7=1  +  7+3  =  3+1  +  7=  3  +  7+1  =  7+1  +  3  =  7+3+1. 

This  is  not  true  of  all  infinite  series.  Let  us  say  that  £  bn  is  a  rearrangement  of  ^  an  if 
the  two  series  have  the  same  terms,  written  in  possibly  different  orders.  Since  the  partial 
sums  of  the  two  series  may  form  entirely  different  sequences,  there  is  no  apparent  reason 
to  expect  them  to  exhibit  the  same  convergence  properties,  and  in  general  they  do  not. 

We  are  interested  in  what  happens  if  we  rearrange  the  terms  of  a  convergent  series.  We 
will  see  that  every  rearrangement  of  an  absolutely  convergent  series  has  the  same  sum,  but 
that  conditionally  convergent  series  fail,  spectacularly,  to  have  this  property. 

Theorem  4.3.24  If  5Zn=j  bn  is  a  rearrangement  of  an  absolutely  convergent  series 
~Y^n=i  an<  then  Y^=i  bn  also  converges  absolutely,  and  to  the  same  sum. 

Proof  Let 

A„  =  |ai|  +  \a2\  +  ■■■+  \a„\    and    Bn  =  \bx\  +  \b2\  +  ■  ■  ■  +  \bn\. 

For  each  n  >  1,  there  is  an  integer  kn  such  that  b\,  b%, . . .,  b„  are  included  among  a\,  a2, 
afCn,  so  Bn  <  Akn.  Since  {An}  is  bounded,  so  is  {Bn},  and  therefore  ^  \bn\  <  oo 
(Theorem  4.3.8). 

Now  let 

An  =  ai  +  a2  -\  h  an,     Bn  =  b\  +  b2  H  h  b„, 

oo  oo 

A  =  ^2an,    and    B  =  ^  bn . 

n=l  n=l 
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We  must  show  that  A  =  B.  Suppose  that  e  >  0.  From  Cauchy's  convergence  criterion  for 
series  and  the  absolute  convergence  of  £  a„ ,  there  is  an  integer  N  such  that 

Wn+i]  +  Wn+2\ H  r-|atf+*l<e>    fc  >  1. 

Choose  N\  so  that  ai,  a%,  ■  ■     o-n  are  included  among  b\,  b2,        fejvi-  If  «  >  A^i, 

then      and  Bn  both  include  the  terms  a\,  a%  cin >  which  cancel  on  subtraction;  thus, 

\An  —  B„  \  is  dominated  by  the  sum  of  the  absolute  values  of  finitely  many  terms  from  ^  an 
with  subscripts  greater  than  N.  Since  every  such  sum  is  less  than  e, 


\A„  -  B„\  <  e    if   n  >  Nt. 

Therefore,  lim„^oo(^4„  —  Bn)  =  0  and  A  =  B.  H 

To  investigate  the  consequences  of  rearranging  a  conditionally  convergent  series,  we 
need  the  next  theorem,  which  is  itself  important. 

Theorem  4.3.25  If  P  =  {ani  }^°  and  Q  =  {am,}j°  are  respectively  the  subsequences 
of  all  positive  and  negative  terms  in  a  conditionally  convergent  series  ^  a„ ,  then 

oo  oo 

^2,a»i  =  00  and  ~^jamj  = —oo.  (4.3.24) 
(=1  j=l 

Proof  If  both  series  in  (4.3.24)  converge,  then  ^  a„  converges  absolutely,  while  if  one 
converges  and  the  other  diverges,  then  ^  an  diverges  to  oo  or  — oo.  Hence,  both  must 
diverge.  H 

The  next  theorem  implies  that  a  conditionally  convergent  series  can  be  rearranged  to 
produce  a  series  that  converges  to  any  given  number,  diverges  to  ±oo,  or  oscillates. 

Theorem  4.3.26  Suppose  that  Yl^Li  an  is  conditionally  convergent  and  fi  and  v  are 
arbitrarily  given  in  the  extended  reals,  with  \i  <  v.  Then  the  terms  of  YlnLi  a"  can  be 
rearranged  to  form  a  series  Yln^=i  bn  with  partial  sums 

B„  =b1+b2  +  ---  +  b„,  n>\, 

such  that 

lim  Bn  =  v    and     lim  Bn  =  fi.  (4.3.25) 


n— >oo 


Proof  We  consider  the  case  where  n  and  v  are  finite  and  leave  the  other  cases  to  you 
(Exercise  4.3.36).  We  may  ignore  any  zero  terms  that  occur  in  ~Y^n=\  a"-  F°r  convenience, 
we  denote  the  positive  terms  by  P  =  {a,  and  and  the  negative  terms  by  Q  =  {—Rj  }^°. 
We  construct  the  sequence 

{bn)T  =  Wl,  ■  ■  ■  .Otmi,-Pl-  ■  ■  ■  ,-Pn,,0(mi+1,  .  .  .  ,am2,-P„1  +  1,  .  .  .  ,-fi„2,  .  .  .}, 

(4.3.26) 
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with  segments  chosen  alternately  from  P  and  Q.  Let  Wo  =  no  =  0.  If  k  >  1,  let  and 
«k  be  the  smallest  integers  such  that  mj  >  m/t_i,  «/t  >  »jt-i, 

mk  nk_i  mk  nk 

X  at  ~  X   - v>  and  X! a''  ~  X  fa  -  f1- 

1=1  y'  =  l  i  =  l  y'  =  l 

Theorem  4.3.25  implies  that  this  construction  is  possible:  since  ^a,-  =  =  oo,  we 

can  choose  mk  and  nk  so  that 

X    a'    and      X  Pi 

i=™k-\  j=nk-{ 

are  as  large  as  we  please,  no  matter  how  large  mk-\  and  nk_\  are  (Exercise  4.3.23).  Since 
mk  and  nk  are  the  smallest  integers  with  the  specified  properties, 

v  <  J»m*+»*_i  <  v  +           £  >  2,  (4.3.27) 

and 

-       <  Bmk+nk  <p,    k>2.  (4.3.28) 
From  (4.3.26),  b„  <  0  if  mk  +  nk—\  <  n  <  mk  +  nk,  so 

Bmk+nk  <  B„  <  Bmk+„k_1,    mk  +  nk-\  <  n  <  mk  +  nk,  (4.3.29) 
while  bn  >  0  if  mk  +  nk  <  n  <  mk+\  +  nk,  so 

Bmk+nk  <  B„  <  Bmk+1+„k,    mk  +  nk  <n  <  mk+1  +  nk.  (4.3.30) 
Because  of  (4.3.27)  and  (4.3.28),  (4.3.29)  and  (4.3.30)  imply  that 

M  -  Pnk  <  Bn  <  v  +  amk,    mk  +  nk-\  <  n  <  mk  +  nk,  (4.3.31) 

and 

H  -  Pnk  <  B„  <  v  +  amk+1 ,    mk  +  nk  <n  <mk+i+nk,  (4.3.32) 

From  the  first  inequality  of  (4.3.27),  Bn  >  v  for  infinitely  many  values  of  n.  However, 
since  lim/^oo  a,-  =  0,  the  second  inequalities  in  (4.3.31)  and  (4.3.32)  imply  that  if  e  >  0 
then  Bn  >  v  +  e  for  only  finitely  many  values  of  n.  Therefore,  lim„^oo  Bn  =  v.  From 
the  second  inequality  in  (4.3.28),  Bn  <  [i  for  infinitely  many  values  of  n.  However,  since 
limy^oo  Pj  =  0,  the  first  inequalities  in  (4.3.31)  and  (4.3.32)  imply  that  if  e  >  0  then 
Bn  <  pi  —  e  for  only  finitely  many  values  of  n.  Therefore,  lim„^^  Bn  =  fi.  H 

Multiplication  of  Series 

The  product  of  two  finite  sums  can  be  written  as  another  finite  sum:  for  example, 

(a0  +  a\  +  a2)(b0  +  b\  +  b2)  =  a0b0  +  a0b\  +  a0b2 

+a\bo  +  a\b\  +  a\b2 
+a2b0  +  a2b\  +  a2b2, 
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where  the  sum  on  the  right  contains  each  product  djbj  (i,  j  =  0,  1,  2)  exactly  once.  These 
products  can  be  rearranged  arbitrarily  without  changing  their  sum.  The  corresponding 
situation  for  series  is  more  complicated. 

Given  two  series 


^  a„    and    ^  b„ 

n=0  n=0 


(because  of  applications  in  Section  4.5,  it  is  convenient  here  to  start  the  summation  index 
at  zero),  we  can  arrange  all  possible  products  a\bj  (z,  j  >  0)  in  a  two-dimensional  array: 


a0b0  a0bi  a0b2  a0b3  ■■■ 

a\bo  ci\b\  ci\b2  a\b3  ■■■ 

ciibo  a2b\  a2b2  a2b3  ■■■  (4.3.33) 

a3b0  a3bi  a3b2  a3b3  ■■■ 


where  the  subscript  on  a  is  constant  in  each  row  and  the  subscript  on  b  is  constant  in  each 
column.  Any  sensible  definition  of  the  product 


5>*  X> 

\«=0      /    \«=0  / 


clearly  must  involve  every  product  in  this  array  exactly  once;  thus,  we  might  define  the 
product  of  the  two  series  to  be  the  series  ^^L0  pn,  where  {pn}  is  a  sequence  obtained 
by  ordering  the  products  in  (4.3.33)  according  to  some  method  that  chooses  every  product 
exactly  once.  One  way  to  do  this  is  indicated  by 


a0b0 

-»•  a0bi 

a0b2 

I 

t 

I 

a\b0 

a-xbi 

aib2 

a\b3 

i 

t 

i 

a2b0 

a2bi 

a2b2 

a2b3 

I 

a-ibo 

<r-  a3bi 

<-  a^b2 

I 

(4.3.34) 
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and  another  by 


a0b0 

/ 

a0bi 

/ 

a0b2 

/ 

a0b3 

a\bo 

ci\b\ 

a\b2 

a\b3 

I 

/ 

/ 

/ 

a2b0 

/ 

a2bi 

/ 

a2b2 

a2b3 

a3b0 

a3b\ 

a3b2 

a3b3 

a0b4 


(4.3.35) 


I  / 

a4b0  :  :  : 

There  are  infinitely  many  others,  and  to  each  corresponds  a  series  that  we  might  consider 
to  be  the  product  of  the  given  series.  This  raises  a  question:  If 


^2  a„  =  A    and     ybn  =  B 


where  A  and  B  are  finite,  does  every  product  series  ^^L0  pn  constructed  by  ordering  the 
products  in  (4.3.33)  converge  to  AB1 

The  next  theorem  tells  us  when  the  answer  is  yes. 
Theorem  4.3.27  Let 

oo  oo 

a„  =  A    and    ^2^n  ~  ^' 

n=0  n=0 

where  A  and  B  are  finite,  and  at  least  one  term  of  each  series  is  nonzero.  Then  Yl^Lo  Pn 
1  H  for  every  sequence  {pn}  obtained  by  ordering  the  products  in  (4.3.33)  if  and  only  if 
On  and     bn  converge  absolutely.  Moreover,  in  this  case,  ^  p„  converges  absolutely. 

Proof  First,  let  {pn }  be  the  sequence  obtained  by  arranging  the  products  {a,-  bj }  accord- 
ing to  the  scheme  indicated  in  (4.3.34),  and  define 

An  =  a0  +  a\  H  h  an,     An  =  \a0\  +  |«i |  H  h  \a„\, 

B„  =b0  +  b1+---  +  bn,     ~B„  =  \b0\  +  \bi\  +  ■■■+  \bn\, 

Pn  =  PO  +  Pi  H  VPn,     Pn  =  \Po\  +  \pi  \  H  h  \Pn\- 

From  (4.3.34),  we  see  that 

P0  =  A0B0,     P3  =  A1B1.     P8  =  A2B2, 

and,  in  general, 

P(m+i)2-i  =  AmBm.  (4.3.36) 
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Similarly, 

7{m+1)2_1=AmBm.  (4.3.37) 

lfJ2\an\  <  oo  and  J2  \b 

n\  <  oo,  then  {Am Bm }  is  bounded  and,  since  Pm  <  P (m_j_i)2_i, 
(4.3.37)  implies  that  {Pm}  is  bounded.  Therefore,  ^  \p„  \  <  oo,  so  ^  /?„  converges.  Now 

oo 

Epn  =  lim  P„  (by  definition) 


n=0 

m—t-oo 


=  lim  P(m+1)2_i  (by  Theorem  4.2.2) 

lim  ^m5m  (from  (4.3.36)) 

(  lim  4m)  (  lim  Bm)  (by  Theorem  4.1.8) 

\m—>oo       /  \m—>OQ  / 


=  AB. 

Since  any  other  ordering  of  the  products  in  (4.3.33)  produces  a  a  rearrangement  of  the 
absolutely  convergent  series  ^^L0  pn,  Theorem  4.3.24  implies  that  £  \qn  |  <  oo  for  every 
such  ordering  and  that  ^^L0  In  =  AB,  This  shows  that  the  stated  condition  is  sufficient. 

For  necessity,  again  let  £2^L0  Pn  be  obtained  from  the  ordering  indicated  in  (4.3.34), 
and  suppose  that  ^^L0  pn  and  all  its  rearrangements  converge  to  AB.  Then  £  />„  must 
converge  absolutely,  by  Theorem  4.3.26.  Therefore,  {/>m2_1}  is  bounded,  and  (4.3.37) 
implies  that  {Am}  and  {Bm}  are  bounded.  (Here  we  need  the  assumption  that  neither  ^  an 
nor  52  bn  consists  entirely  of  zeros.  Why?)  Therefore,  £2  \an\  <  oo  and  J2\bn\  <  oo.  TJ 

The  following  definition  of  the  product  of  two  series  is  due  to  Cauchy.  We  will  see  the 
importance  of  this  definition  in  Section  4.5. 

Definition  4.3.28  The  Cauchy  product  of^^Lo  <*n  and  Yl™=o  bn  is  Y^n=o  c«'  where 

cn  =  aobn  +  a\bn-\  H  1-  an-\b\  +  anb0.  (4.3.38) 

Thus,  cn  is  the  sum  of  all  products  atbj,  where  i  >  0,  j  >  0,  and  i  +  j  =  n;  thus, 

n  n 

cn  =  ^2,arbn-r  =  ^2,bran-r.  (4.3.39) 


Henceforth,  {J^=0  an)  (Z2«^=o  bn)  should  be  interpreted  as  the  Cauchy  product.  Notice 
that 

it  A  (f>»)=(x>»)  (x>l 

\n=0      /    \n=0      /         \n=0      /   \n=0  / 

and  that  the  Cauchy  product  of  two  series  is  defined  even  if  one  or  both  diverge.  In  the  case 
where  both  converge,  it  is  natural  to  inquire  about  the  relationship  between  the  product  of 
their  sums  and  the  sum  of  the  Cauchy  product.  Theorem  4.3.27  yields  a  partial  answer  to 
this  question,  as  follows. 
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Theorem  4.3.29  If'Yl'n'=oan  and  YlnLo^n  converge  absolutely  to  sums  A  and  B, 
then  the  Cauchy  product  ofY^Lo  O-n  cind  ^^L0      converges  absolutely  to  AB. 

Proof   Let  Cn  be  the  nth  partial  sum  of  the  Cauchy  product;  that  is, 

C„  =  C0  +  Ci  H  h  Cn 

(see  (4.3.38)).  Let  Yl™=o  Pn  be  tne  series  obtained  by  ordering  the  products  {at,  bj}  ac- 
cording to  the  scheme  indicated  in  (4.3.35),  and  define  Pn  to  be  its  «th  partial  sum;  thus, 

Pn  =  Po  +  Pi  H  V  Pn- 

Inspection  of  (4.3.35)  shows  that  cn  is  the  sum  of  the  n  +  1  terms  connected  by  the  diagonal 
arrows.  Therefore,  Cn  =  Pm„ ,  where 

n(n  +  3) 

m„  =  1  +  2  +  •  •  •  +  (n  +  1)  -  1  =  — - — -. 

From  Theorem  4.3.27,  lim„^oo  Pmn  =  AB,  so  limn^.oo  C„  =  AB.  To  see  that  ^2  \cn\  < 
oo,  we  observe  that 

n  m„ 

X>-i  <X>*i 

and  recall  that  ^  \ps\  <  oo,  from  Theorem  4.3.27.  H 

Example  4.3.25  Consider  the  Cauchy  product  of  ^^L0  r"  with  itself.  Here  an  = 
b„  =  r"  and  (4.3.39)  yields 

c„  =  r°rn  +  rlr"-1  +■■■  +  r^V1  +  r"r°  =  (n  +  l)r", 

so 

(oo        \  *  oo 
n=0      )  n=0 

Since 

^  1 

\r\  <  1, 


yr"  =  J-. 

i_r 


n=0 

and  the  convergence  is  absolute,  Theorem  4.3.29  implies  that 


n=0 

Example  4.3.26  If 


oo 


Ea"  =  E-^-    and  E^E-^p 

«  =  0  71  =  0  77=0  77=0 
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then  (4.3.39)  yields 


-       gn-mpm     =  J_  y      n  \  =  («  +  fif 

"      ^— '  (n—m)\m\      n\  ^— '  \m/  «! 
m=o  v         7  m=o  \  / 


thus, 


oo 


You  probably  know  from  calculus  that  ^^10  xn  jn\  converges  absolutely  for  all  x  to  ex . 
Thus,  (4.3.40)  implies  that 


e*eft  =e<x+f} 


a  familiar  result. 


The  Cauchy  product  of  two  series  may  converge  under  conditions  weaker  than  those 
of  Theorem  4.3.29.  If  one  series  converges  absolutely  and  the  other  converges  condi- 
tionally, the  Cauchy  product  of  the  two  series  converges  to  the  product  of  the  two  sums 
(Exercise  4.3.40).  If  two  series  and  their  Cauchy  product  all  converge,  then  the  sum  of 
the  Cauchy  product  equals  the  product  of  the  sums  of  the  two  series  (Exercise  4.5.32). 
However,  the  next  example  shows  that  the  Cauchy  product  of  two  conditionally  convergent 
series  may  diverge. 

Example  4.3.27  If 

a„  =b„  = 

s/n  +  l 

then  ^^L0  an  and  ^^L0  bn  converge  conditionally.  From  (4.3.39),  the  general  term  of 
their  Cauchy  product  is 

c    _    "     (_l)r  +  l(_ir-r  +  l  "  1  1 

"     f^o  V^TT^n  -r+1  f^Q  v^TT  V«  -  r  +  V 

so 

A      1  1  _"+!_, 

Therefore,  the  Cauchy  product  diverges,  by  Corollary  4.3.6. 


4.3  Exercises 


1.  Prove  Theorem  4.3.2. 

2 .  Prove  Theorem  4 . 3 . 3 . 

3.  (a)    Prove:  If  an  =  bn  except  for  finitely  many  values  of  «,  then  ^  an  and  £  bn 

converge  or  diverge  together. 
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(b)   Let  b„k  =  cik  for  some  increasing  sequence  {n^}™  of  positive  integers,  and 
bn  =  0  if  n  is  any  other  positive  integer.  Show  that 

oo  oo 

/,  b„    and     ^  a„ 
«=i  «=i 

diverge  or  converge  together,  and  that  in  the  latter  case  they  have  the  same  sum. 
(Thus,  the  convergence  properties  of  a  series  are  not  changed  by  inserting  zeros 
between  its  terms.) 

4.  (a)    Prove:  If  ^  an  converges,  then 

lim  (an  +  an+i  H  h  an+r)  =  0,     r  >  0. 

n— >oo 

(b)   Does  (a)  imply  that     an  converges?  Give  a  reason  for  your  answer. 

5.  Prove  Corollary  4.3.7. 

6.  (a)    Verify  Corollary  4.3.7  for  the  convergent  series  ^  \/np  (p  >  1).  Hint:  See 

the  proof  of  Theorem  4.3.10. 
(b)    Verify  Corollary  4.3.7  for  the  convergent  series        1)" /n. 

7.  Prove:  If  0  <  bn  <  an  <  bn+\ ,  then     an  and  ^  bn  converge  or  diverge  together. 

8.  Determine  convergence  or  divergence. 


(a)V^2  (b)V  '  

v  '  ^  V^rpT  V  '  ^«2[l  +  ±sin(«:r 


/4)] 


(c)  £  „ 

(e)  Esin^ 

(g)  E  -  cot  - 

me1*" 

Suppose  that  / (x)  >  0  for  x  >  k.  Prove  that  J",    / (x)  dx  <  oo  if  and  only  if 

/    / (x)   <  00. 

Hint:  Use  Theorems  3.4.5  cmd 4.3.8. 
10.     Use  the  integral  test  to  find  all  values  of  p  for  which  the  series  converges. 

(  \  n  /,  \  ^      "2  /  \  smnw 

^  ^  (n2  -  l)P  ^  '  ^  («3  +  4)p  ^  ^  (coshn)^ 
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11.     Let  Ln  be  the  nth  iterated  logarithm.  Show  that 

1 


L0(«)Li (n)  ■  ■  ■  Lk(n)  [Lk+l  (n)]p 
converges  if  and  only  if  p  >  1.  Hint:  See  Exercise  3.4.10. 

12.  Suppose  that  g,  g',  and  (g')2  —  gg"  are  all  positive  on  [R,  oo).  Show  that 

v-*  g'(n) 

if  and  only  if  limx^oo  g(x)  <  oo. 

13.  Let 

oo 

5^  =  £^  ^>L 

«=1 

Show  that 

N 


1  <  s^)  -  y  —  < 

AT  +  l)^"1  ^  nP 


(p  -  \)(N  +  \)p~1        v^  O-l)^-1' 

HINT:  See  the  proof  of  Theorem  4.3.10. 
14.     Suppose  that  /  is  positive,  decreasing,  and  locally  integrable  on  [1,  oo],  and  let 


»  rn 

a„  =  J2         _  Jj   f(x)  dx- 


k=l 

(a)  Show  that  {an }  is  nonincreasing  and  nonnegative,  and 

0  <  lim  a„  <  f(l). 

n— >oo 

(b)  Deduce  from(a)  that 

/ll  1  \ 

y  =  hm    1  +  -  +  H  1  log« 

n^oo  \       2      3  n  ) 

exists,  and  0  <  y  <  1.  (y  is  Euler's  constant;  y  0.577.) 
15.     Determine  convergence  or  divergence. 

(a)  >    —   (b)  >   r  (r  >  0) 

(c)  Y,e~nP  ™&«P  (P  >  0)  (d)  J]  "+l0g" 

,    .  «  +  lQg«  ,   .  (1  +  !/«)" 

^  '  ^  «2log«  2" 
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16.     Let  Ln  be  the  nth  iterated  logarithm.  Prove  that 

1 


E 


[L0(»)P+1  [Li(«)]"+1  ■■■[Lm(n)]9m+1 


converges  if  and  only  if  there  is  at  least  one  nonzero  number  in  {qo,  q\ , . . . ,  qm }  and 
the  first  such  is  positive.  Hint:  See  Exercises  4.3.1 1  and  2AA2(b). 

17.  Determine  convergence  or  divergence. 

,  .  ^  2  +  sin2(«7r/4)  ,  .  ^  n(n  +  1) 

*  '  '  ■  2n  '  4" 

/  s       3  -  sin(«jr/2)  .  .       n  +  (-1)" 

^  '  ^     n(n  +  1)  ^  '  ^  «(«  +  1) 

18.  Determine  convergence  or  divergence,  with  r  >  0. 

(a)E^  (b)£»V  (c)E^ 

r2«  +  l  „2« 


WEt^txw  (e)E 


(2«  +  1)!  ^ — '  (2«)! 

19.  Determine  convergence  or  divergence. 

(2«)!   (3n)!  

la'^22»(/i!)2  11  '  L  33"«!(«  +  1)!(«  +  3)! 

^         2"n!  y-^  a(a  +  I)  ■  ■  ■  (a  +n-l) 

(c)E5...7.(2n  +  3)    WE^  +  d-^^d  <«.*>°> 

20.  Determine  convergence  or  divergence. 

(c)D,1  +  1)(i±~)"     «!)£(, -if 

21.  Give  counterexamples  showing  that  the  following  statements  are  false  unless  it  is 
assumed  that  the  terms  of  the  series  have  the  same  sign  for  n  sufficiently  large. 

(a)  an  converges  if  its  partial  sums  are  bounded. 

(b)  If  bn  y£  0  for  n  >  k  and  liirin-xx,  an/bn  =  L,  where  0  <  L  <  oo,  then  ^  an 
and     bn  converge  or  diverge  together. 

(c)  If  a„  ^  0  and  lim„^.oo  an+i/an  <  1,  then  ^  an  converges. 

(d)  If  a„     0  and  lim„^oo  n  [(an+i/a„)  -  1]  <  — 1,  then  J2an  converges. 

22.  Prove:  If  the  terms  of  a  convergent  series  ^  an  have  the  same  sign  for  n  >  k,  then 
J2  an  converges  absolutely. 

23.  Suppose  that  an  >  0  for n  >  m  and  ^an  =  oo.  Prove:  If  N  is  an  arbitrary  integer 
>  m  and  /  is  an  arbitrary  positive  number,  then  ~Ylm=N  an  >  J  for  some  positive 
integer  k . 

24.  Prove  Theorem  4.3.19. 
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25.     Show  that  the  series  converges  absolutely. 


cos  nd 


V«3  -  1 

26.  Show  that  the  series  converges. 

(a)  Z,  „2  +      (-°°  < 0  <  °°)  (b)  Z  — ^—  (0  ^  2kn> k  =  inteser) 

27.  Determine  whether  the  series  is  absolutely  convergent,  conditionally  convergent,  or 
divergent. 

(a)  ^  —j=  (t>4m  =  t>4m  +  i  =  1,  &4m+2  =  £>4m+3  =  —1) 
.     .   .r-^  I     .     nn  ,    .   .r-^    1  «7T 

(b)  >    -sin —  (c)  >    —cos  — 

l-3-5---(2n  +  l)  . 

(a)  >   sinnft 

v  '  ^  4  •  6  •  8  •  •  •  (2n  +  4) 

28.  Let  g  be  a  rational  function  (ratio  of  two  polynomials).  Show  that  ^  g{n)rn  con- 
verges absolutely  if  \r\  <  1  or  diverges  if  \r\  >  1.  Discuss  the  possibilities  for 
|r|  =  l. 

29.  Prove:  If     a„  <  oo  and  J]  £2  <  oo,  then     a„6„  converges  absolutely. 

30.  (a)   Prove:  If     an  converges  and     a„  =  oo,  then  J]  a„  converges  condition- 

ally. 

(b)  Give  an  example  of  a  series  with  the  properties  described  in  (a). 

31.  Suppose  that  0  <  an+\  <  an  and 

,.     bi  +b2  +  ---  +  b„ 

hm    >  0, 

n— >oo  Wn 

where  {w„}  is  a  sequence  of  positive  numbers  such  that 

^2wn(an  -a„+i)  =  oo. 

Show  that  ^2anbn  =  oo.  Hint:  Use  summation  by  parts. 

32.  (a)   Prove:  If  0  <  2e  <  6  <  jt  -  2e,  then 

I  sin  9 1  +  I  sin  201  H  \-\sinn6\  sine 

hm    >   . 

n— >oo  1  2 

Hint:  STiow  f/;af  |  sinn#|  >  sine  at  least  "half  the  time";  more  precisely, 
show  that  if\  sinm#|  <  sine  for  some  integer  m  then  |  sin(w  +  \)9\  >  sine. 
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(b)    Show  that 

Esin  n8 
nP 

converges  conditionally  if  0  <  p  <  1  and  6  ^  kn  (k  =  integer).  Hint:  Use 
Exercise  4.3.31  and  see  Example  4.3.22. 

33.     Show  that 

yj+i 


-  _  i  -   i 

^-f      n       ~  2  2-i  n(2n 


n  =  l 


2  ^  n(2n  -  1) 

n  =  l 


34.     Let  &3m+i,  bim+2  =  —2,  and  bj,m+j,  =  1  for  m  >  0.  Show  that 


bn       2    ~  1 


f=f  »      3  (, 


n  (m  +  l)(3w  +  l)(3m  +  2) 

n=l  m=0  v  v  ' 

35.  Let  ^2b„  be  obtained  by  rearranging  finitely  many  terms  of  a  convergent  series 

O-n .  Show  that  the  two  series  have  the  same  sum. 

36.  Prove  Theorem  4.3.26  for  the  case  where  (a)  /i  is  finite  and  v  =  oo;  (b)  p,  =  — oo 
and  v  =  oo;  (c)  fi  =  v  =  oo. 

37.  Give  necessary  and  sufficient  conditions  for  a  divergent  series  to  have  a  convergent 
rearrangement. 

38.  A  series  diverges  unconditionally  to  oo  if  every  rearrangement  of  the  series  diverges 
to  oo.  State  necessary  and  sufficient  conditions  for  a  series  to  have  this  property. 

39.  Suppose  that  /  and  g  have  derivatives  of  all  orders  at  0,  and  let  h  =  fg.  Show 
formally  that 

\n=0  '  /    \n=0         '  /  n=0 

in  the  sense  of  the  Cauchy  product.  Hint:  See  Exercise  2.3.12. 

40.  Prove:  If     \a„  |  <  oo  and  ^  bn  converges  (perhaps  conditionally),  with  YnLo  an  = 
A  and  ^^10  bn  =  B,  then  the  Cauchy  product 


oo  /  oo       \    /  oo  ^ 

n=0  \n=0      I    \n=0  ) 


converges  to  AB.  HINT:  Let  {An},  {Bn},  and  {C„}  be  the  partial  sums  of  the  series. 
Show  that 


Cn  —  AnB  =  ^^ar(Bn-r  —  B) 


r=0 


and  apply  Theorem  4.3.5  to  Y  \o-n  I- 
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41.  Suppose  that  ar  >  0  for  all  r  >  0  and  and  Y^o*  ar  =  A  <  oo.  Show  that 

j  n— l  -  «— l 

lim  —  >    <3r+s  =  0    and      lim  —  >    ar_,s  =  2A  —  ao. 

n— >oo  ;7  ^— '  n  — *oa  n  ^— ' 

r,j=0  r,i=0 

42.  Prove:  If  lim^oo  cr?  =  aj  (j  >  1)  and  |fl;  |  <  07  (7,  y  >  1),  where  YlJ^i  ai  < 
00,  then  lim^oo  J^T=i  aj   =  T,T=i  a J  • 

43.  Prove:  If  an  >  0,  n  >  1,  and  J^^Li      =  °°> tnen  2Zn^=i  an/(l  +  an)  =  oo- 


4.4  SEQUENCES  AND  SERIES  OF  FUNCTIONS 


Until  now  we  have  considered  sequences  and  series  of  constants.  Now  we  turn  our  attention 
to  sequences  and  series  of  real-valued  functions  defined  on  subsets  of  the  reals.  Throughout 
this  section,  "subset"  means  "nonempty  subset." 

If  Fk,  Fk+i,  Fn, . . .  are  real-valued  functions  defined  on  a  subset  D  of  the  reals, 
we  say  that  {Fn}  is  an  infinite  sequence  or  (simply  a  sequence)  of  functions  on  D .  If  the 
sequence  of  values  {Fn  (x)}  converges  for  each  x  in  some  subset  S  of  D,  then  {Fn}  defines 
a  limit  function  on  5.  The  formal  definition  is  as  follows. 

Definition  4.4.1  Suppose  that  {Fn}  is  a  sequence  of  functions  on  D  and  the  sequence 
of  values  {Fn(x}}  converges  for  each  x  in  some  subset  S  of  D.  Then  we  say  that  {Fn} 
converges  pointwise  on  S  to  the  limit  function  F,  defined  by 

F(x)  =  lim  F„(x),     x  €  S. 

n— >oo 

Example  4.4.1  The  functions 

(        nx  \ 
define  a  sequence  on  D  =  (—00,  1],  and 


h/2 

,    n  >  1, 


{00,  x  <  0, 
1,  x  =  0, 
0,     0  <  x  <  l. 

Therefore,  {Fn}  converges  pointwise  on  5  =  [0,  l]  to  the  limit  function  F  defined  by 


F{x) 


l,    x  =  0, 
0,    0  <  x  <  l. 


Example  4.4.2  Consider  the  functions 

Fn(x)  =  xne~nx,    x>0,  n>\, 


(Figure  4.4. 1). 
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y  =  e 


Figure  4.4.1 


Equating  the  derivative 


F'n(x)  =  nx 


77  —  1  —nx 


(l-x) 


to  zero  shows  that  the  maximum  value  of  Fn{x)  on  [0,  oo)  is  e  ",  attained  at  x  =  1. 
Therefore, 

\F„(x)\<e-",  x>0, 

so  liirin^oo  Fn(x)  =  0  for  all  x  >  0.  The  limit  function  in  this  case  is  identically  zero  on 
[0,oo). 

Example  4.4.3  For«  >  1,  let  F„  be  defined  on  (— oo,  oo)  by 


fO, 


Fn(x)  = 


-n(2  +  nx). 


2 

n  x, 


n{2  —  nx), 
0, 


x  <  -J, 


-2  <x<-i, 

n  —  n  ' 


-i<x<i. 


(Figure  4.4.2,  page  236), 

Since  F„(0)  =  0  for  all  n,  lim„^oo  F„(0)  =  0.  If  x  ±  0,  then  F„(jc)  =  0  if »  >  2/|jc|. 
Therefore, 

lim  F„(x)  =  0.     — oo  <  x  <  oo, 

n— >oo 

so  the  limit  function  is  identically  zero  on  (— oo,  oo). 

Example  4.4.4  For  each  positive  integer  n,  let  Sn  be  the  set  of  numbers  of  the  form 
x  =  p/q,  where  p  and  q  are  integers  with  no  common  factors  and  1  <  q  <  n.  Define 


Fn(x) 


1,  x  e  S„, 
0,    x  g  Sn . 
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If  x  is  irrational,  then  x  $  Sn  for  any  n,  so  Fn(x)  =  0,  n  >  1.  If  x  is  rational,  then  x  €  Sn 
and  Fn(x)  =  1  for  all  sufficiently  large  «.  Therefore, 


Figure  4.4.2 


Uniform  Convergence 

The  pointwise  limit  of  a  sequence  of  functions  may  differ  radically  from  the  functions  in 
the  sequence.  In  Example  4.4.1,  each  Fn  is  continuous  on  (— oo,  1],  but  F  is  not.  In 
Example  4.4.3,  the  graph  of  each  Fn  has  two  triangular  spikes  with  heights  that  tend  to 
oo  as  n  oo,  while  the  graph  of  F  (the  x-axis)  has  none.  In  Example  4.4.4,  each  Fn 
is  integrable,  while  F  is  nonintegrable  on  every  finite  interval.  (Exercise  4.4.3).  There  is 
nothing  in  Definition  4.4.1  to  preclude  these  apparent  anomalies;  although  the  definition 
implies  that  for  each  xq  in  S,  Fn(xo)  approximates  F(xo)  if  n  is  sufficiently  large,  it 
does  not  imply  that  any  particular  Fn  approximates  F  well  over  all  of  S .  To  formulate  a 
definition  that  does,  it  is  convenient  to  introduce  the  notation 

\\g\\s  =  sup  \g(x)\ 
xeS 

and  to  state  the  following  lemma.  We  leave  the  proof  to  you  (Exercise  4.4.4). 
Lemma  4.4.2  If  g  and  h  are  defined  on  S,  then 

\\g  +  Hs  <  \\g\\s  +  \\h\\s 

and 

\\gh\\s  <  \\g\\s\\h\\s- 
Moroever,  if  either  g  or  h  is  bounded  on  S,  then 

\\g-h\\s  >  |||g||s  -  \\h\\s\\\- 
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Definition  4.4.3  A  sequence  {Fn}  of  functions  defined  on  a  set  S  converges  uniformly 
to  the  limit  function  F  on  S  if 

lim  \\Fn-F\\s=0. 

n—*oo 

Thus,  {Fn}  converges  uniformly  to  F  on  5  if  for  each  e  >  0  there  is  an  integer  N  such  that 

\\F„-F\\s<e    if    n>N.  (4.4.1) 


If  S  =  [a,b]  and  F  is  the  function  with  graph  shown  in  Figure  4.4.3,  then  (4.4.1)  implies 
that  the  graph  of 

y  =  Fn{x),    a  <  x  <  b, 

lies  in  the  shaded  band 


F(x)  —  e  <  y  <  F(x)  +  e,    a  <  x  <  b, 


\fn>N. 

From  Definition  4.4.3,  if  {Fn}  converges  uniformly  on  S,  then  {Fn}  converges  uniformly 
on  any  subset  of  S  (Exercise  4.4.6). 


y 


y=F(x)+e 

y=F(x) 

y=F(x)-( 


Figure  4.4.3 


Example  4.4.5  The  sequence  {F„}  defined  by 

F„(x)  =  x"e-"\    n  >  1, 

converges  uniformly  to  F  =  0  (that  is,  to  the  identically  zero  function)  on  5  =  [0,  oo), 
since  we  saw  in  Example  4.4.2  that 
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so 


F„  -  F\\s  <  e 


if  n  >  —  log  e.  For  these  values  of  n,  the  graph  of 


y  =  Fn(x),     0  <  x  <  oo, 


lies  in  the  strip 


—e<y<e,    x  >  0 


(Figure  4.4.4). 


The  next  theorem  provides  alternative  definitions  of  pointwise  and  uniform  convergence. 
It  follows  immediately  from  Definitions  4.4. 1  and  4.4.3. 

Theorem  4.4.4  Let  {Fn}  be  defined  on  S.  Then 

(a)    {  Fn  }  converges  pointwise  to  F  on  S  if  and  only  if  there  is,  for  each  e  >  0  and  x  €  S, 
an  integer  N  (which  may  depend  on  x  as  well  as  e)  such  that 


(b)    {Fn}  converges  uniformly  to  F  on  S  if  and  only  if  there  is  for  each  e  >  0  an  integer 
N  (which  depends  only  on  e  and  not  on  any  particular  x  in  S)  such  that 


Fn(x)-F(x)\  <e    if  n>N. 


Fn(x)  —  F(x)\  <  e   for  all  x  in  S  ifn  >  N . 


y 


y  =  -e 


Figure  4.4.4 


The  next  theorem  follows  immediately  from  Theorem  4.4.4  and  Example  4.4.6. 
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Theorem  4.4.5  If{F„ }  converges  uniformly  to  F  on  S,  then  {F„}  converges  pointwise 
to  F  on  S.  The  converse  is  false;  that  is,  pointwise  convergence  does  not  imply  uniform 
convergence. 


Example  4.4.6  The  sequence  {Fn}  of  Example  4.4.3  converges  pointwise  to  F  =  0 
on  (—oo,  oo),  but  not  uniformly,  since 


\Fn      F  ||  (—00,00) 


-*G)-Kt) 


=  n, 


so 


lim  \\F„  -  F || (-oo  oo)  =  00. 

n— >oo 

However,  the  convergence  is  uniform  on 

Sp  =  (-00,  p]  U  [p,  00) 

for  any  p  >  0,  since 

2 

\\F„  -  F\\s0  =  0    if   n  >  -. 


Example  4.4.7  If  Fn(x)  =  x",  n  >  1,  then  {Fn}  converges  pointwise  on  5  =  [0, 1] 
to 

F(x)-^  X=h 
W_|0,  0<x<l. 

The  convergence  is  not  uniform  on  S.  To  see  this,  suppose  that  0  <  e  <  1.  Then 
\FH(x)-F(x)\  >  1-e    if  <x  <  1. 

Therefore, 

l-e<  \\Fn-F\\s  <  1 
for  all  n  >  1 .  Since  e  can  be  arbitrarily  small,  it  follows  that 

II  ^--F  lis  =  1 

for  all  n  >  1 . 

However,  the  convergence  is  uniform  on  [0,  p]  if  0  <  p  <  1,  since  then 

\\F„  -  F\\[0iP]  =  p" 

and  limn-,-00  pn  =  0.  Another  way  to  say  the  same  thing:  {Fn}  converges  uniformly  on 
every  closed  subset  of  [0,  1).  ■ 


The  next  theorem  enables  us  to  test  a  sequence  for  uniform  convergence  without  guessing 
what  the  limit  function  might  be.  It  is  analogous  to  Cauchy's  convergence  criterion  for 
sequences  of  constants  (Theorem  4.1.13). 
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Theorem  4.4.6  (Cauchy's  Uniform  Convergence  Criterion)  A  sequence 
of  functions  {Fn}  converges  uniformly  on  a  set  S  if  and  only  if  for  each  e  >  0  there  is  an 
integer  N  such  that 

\\F„-Fm\\s  <€    if   n,m>N.  (4.4.2) 

Proof   For  necessity,  suppose  that  {Fn  }  converges  uniformly  to  F  on  S.  Then,  if  e  >  0, 
there  is  an  integer  N  such  that 

\\Fk-F\\s<€-    if  k>N. 

Therefore, 

\\F„-Fm\\s=  \\{Fn-  F)+(F  -  Fm)\\s 

<\\F„-F\\S  +  \\F -Fm\\s    (Lemma  4.4.2) 

€  € 

<  — I —  =  €    if    m.n  >  N. 
2  2 

For  sufficiency,  we  first  observe  that  (4.4.2)  implies  that 

\Fn(x)  -  Fm(x)\  <  €    if  n,m>N, 

for  any  fixed  x  in  S.  Therefore,  Cauchy's  convergence  criterion  for  sequences  of  constants 
(Theorem  4.1.13)  implies  that  {Fn(x)}  converges  for  each  x  in  S;  that  is,  {Fn}  converges 
pointwise  to  a  limit  function  F  on  S .  To  see  that  the  convergence  is  uniform,  we  write 

\Fm(x)  -  F{x)\  =  \[Fm(x)  -  F„(x)]  +  [Fn(x)  -  F(x)]\ 

<  \Fm{x)  -  Fn(x)\  +  \Fn(x)  -  F(x)\ 

<  \\Fm-  Fn\\s  +  \Fn(x)-  F(x)\. 

This  and  (4.4.2)  imply  that 

\Fm(x)  -  F(x)\  <  e  +  \Fn(x)  -  F(x)\    if    n,m>N.  (4.4.3) 
Since  lim,,.^  Fn(x)  =  F(x), 

\Fn(x)-F(x)\  <€ 

for  some  n  >  N,  so  (4.4.3)  implies  that 

\Fm(x)-F(x)\  <2e    if   m  >  N. 

But  this  inequality  holds  for  all  x  in  S,  so 

ll^m-/r||s<2e    if   m  >  N. 

Since  e  is  an  arbitrary  positive  number,  this  implies  that  {Fn}  converges  uniformly  to  F 
on  5.  H 

The  next  example  is  similar  to  Example  4.1.14. 
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Example  4.4.8  Suppose  that  g  is  differentiable  on  S  =  (—00,  00)  and 

\g'(x)\  <  r  <  1,    -00  <  x  <  00.  (4.4.4) 

Let  Fq  be  bounded  on  S  and  define 

F„(jc)  =g(Fn-1(x)),     n>\.  (4.4.5) 

We  will  show  that  { Fn }  converges  uniformly  on  S .  We  first  note  that  if  u  and  v  are  any  two 
real  numbers,  then  (4.4.4)  and  the  mean  value  theorem  imply  that 

\g(u)-g(v)\  <  r\u-v\.  (4.4.6) 

Recalling  (4.4.5)  and  applying  this  inequality  with  u  =  Fn-\ (x)  and  v  =  0  shows  that 

\Fn(x)\  =  \g(0)  +  (g(FB_x(jc))  -g(0))|  <  |g(0)|  +  |g(F»-iW)-?(0)| 
<  |g(0)|  +r|F„_i(ac)|; 

therefore,  since  fo  is  bounded  on  5,  it  follows  by  induction  that  Fn  is  bounded  on  S  for 
n  >  1.  Moreover,  if  n  >  1,  then  (4.4.5)  and  (4.4.6)  with  u  =  F„(x)  and  v  =  F„-i(x) 
imply  that 

\F„+i(x)-  F„(x)\  =  \g(Fn(x))-g(Fn-1(x))\  <  r\Fn(x)-Fn-1(x)\,  -00  <  x  <  00, 
so 

\\Fn+i  -  F„\\s  <  r||F„  -  Fn-i\\s. 
By  induction,  this  implies  that 

\\Fn+l-Fn\\s<rn\\Fl-F0\\s.  (4.4.7) 

If  n  >  m,  then 

l-fji  —  Fm\\s  =  \\(F„  —  Fn-i)  +  (Fn-i  —  Fn-2)  +  h  (Fm+\  —  Fm)\\s 

<  \\F„  —  Fn-i\\s  +  \\Fn-i  —  F„_2||s  +  •■•  +  ll^m+i  —  Fm\\s, 

from  Lemma  4.4.2.  Now  (4.4.7)  implies  that 

II F„  -  Fm\\s  <  ||F,  -  F0\\s(l  +  r  +  r2  +  ■  ■  ■  +  r"-m-l)rm 

<    H^i -F0II5-  • 

1  —  r 

Therefore,  if 

„  rN 

\\F\  —  -Foils-         <  e, 

1  —  r 

then  \\Fn  —  Fm\\s  <  e  if  n,  m  >  N .  Therefore,  {Fn}  converges  uniformly  on  S,  by 
Theorem  4.4.6. 
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Properties  Preserved  by  Uniform  Convergence 

We  now  study  properties  of  the  functions  of  a  uniformly  convergent  sequence  that  are 
inherited  by  the  limit  function.  We  first  consider  continuity. 

Theorem  4.4.7  If{F„}  converges  uniformly  to  F  on  S  and  each  Fn  is  continuous  at 
a  point  Xq  in  S,  then  so  is  F .  Similar  statements  hold  for  continuity  from  the  right  and  left. 

Proof    Suppose  that  each  Fn  is  continuous  at  xo-  If  x  €  5  and  n  >  1,  then 

\F(x)  -  F(x0)\  <  \F(x)  -  Fn(x)\  +  \Fn(x)  -  Fn(x0)\  +  \Fn(x0)  -  F(x0)\ 

<  \Fn(x)-Fn(x0)\+2\\Fn-F\\s.  k  •  -6> 

Suppose  that  e  >  0.  Since  {Fn}  converges  uniformly  to  F  on  S,  we  can  choose  n  so  that 
\\Fn  ~  F\\s  <  €.  For  this  fixed  n,  (4.4.8)  implies  that 

\F(x)-F(x0)\<\Fn(x)-Fn(x0)\+2e,    x  e  5.  (4.4.9) 

Since  Fn  is  continuous  at  xo,  there  is  a  S  >  0  such  that 

\F„(x)  -  F„(x0)\  <  e    if  \x-xo\<8, 

so,  from  (4.4.9), 

\F(x)-F(x0)\<3e,    if  \x-x0\<8. 

Therefore,  F  is  continuous  at  xq.  Similar  arguments  apply  to  the  assertions  on  continuity 
from  the  right  and  left.  TJ 

Corollary  4.4.8  If{Fn}  converges  uniformly  to  F  on  S  and  each  F„  is  continuous  on 
S,  then  so  is  F;  that  is,  a  uniform  limit  of  continuous  functions  is  continuous. 

Now  we  consider  the  question  of  integrability  of  the  uniform  limit  of  integrable  func- 
tions. 

Theorem  4.4.9  Suppose  that  {Fn}  converges  uniformly  to  F  on  S  =  [a,b].  Assume 
that  F  and  all  F„  are  integrable  on  [a,b].  Then 

pb  pb 

/    F(x)dx=  lim  /    Fn(x)dx.  (4.4.10) 

J  a  J  a 

Proof  Since 

pb  pb  pb 

I    Fn(x)dx—  I    F(x)dx  <  I    \Fn(x)  —  F(x)\  dx 

J  a  J  a  J  a 

<(b-a)\\F„-F\\s 

and  lim„_»oo  II  Fn  —  F \\ g  =  0,  the  conclusion  follows.  H 
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In  particular,  this  theorem  implies  that  (4.4.10)  holds  if  each  Fn  is  continuous  on  [a,  b], 
because  then  F  is  continuous  (Corollary  4.4.8)  and  therefore  integrable  on  [a,  b]. 

The  hypotheses  of  Theorem  4.4.9  are  stronger  than  necessary.  We  state  the  next  theorem 
so  that  you  will  be  better  informed  on  this  subject.  We  omit  the  proof,  which  is  inaccessible 
if  you  skipped  Section  3.5,  and  quite  involved  in  any  case. 

Theorem  4.4.10  Suppose  that  {F„}  converges  pointwise  to  F  and  each  F„  is  inte- 
grable on  [a,  b]. 

(a)  If  the  convergence  is  uniform,  then  F  is  integrable  on  [a,  b]  and  (4.4.10)  holds. 

(b)  If  the  sequence  {||^nl|[a,6]}  is  bounded  and  F  is  integrable  on  [a,  b],  then  (4.4.10) 
holds. 

Part  (a)  of  this  theorem  shows  that  it  is  not  necessary  to  assume  in  Theorem  4.4.9  that  F 
is  integrable  on  [a,b],  since  this  follows  from  the  uniform  convergence.  Part  (b)  is  known 
as  the  bounded  convergence  theorem.  Neither  of  the  assumptions  of  (b)  can  be  omitted. 
Thus,  in  Example  4.4.3,  where  {||F„  ||[o,i]}  is  unbounded  while  F  is  integrable  on  [0,  1], 

f  Fn(x)dx  =  1,    n  >  1,    but     f  F(x)dx  =  0. 
Jo  Jo 

In  Example  4.4.4,  where  \\Fn  \\[a,b]  =  1  for  every  finite  interval  [a,  b],  Fn  is  integrable  for 
all  7i  >  1,  and  F  is  nonintegrable  on  every  interval  (Exercise  4.4.3). 

After  Theorems  4.4.7  and  4.4.9,  it  may  seem  reasonable  to  expect  that  if  a  sequence  {Fn } 
of  differentiable  functions  converges  uniformly  to  F  on  S,  then  F'  =  limn^oo  F'n  on  S. 
The  next  example  shows  that  this  is  not  true  in  general. 

Example  4.4.9  The  sequence  {Fn}  defined  by 

F„(x)  =  x"  sin— I— 


converges  uniformly  to  F  =  0  on  [r\ ,  r-^\  if  0  <  T\  <  r-i  <  1  (or,  equivalently,  on  every 
compact  subset  of  (0,  1)).  However, 

F'(x)  =  nxn~l  sin  ; —  (n  —  1)  cos  r, 

xn  xn 

so  {F„(x)}  does  not  converge  for  any  x  in  (0,  1). 

Theorem  4.4.11  Suppose  that  F'n  is  continuous  on  [a,b]  for  all  n  >  1  and  {F„} 
converges  uniformly  on  [a,  b].  Suppose  also  that  {Fn(xo)}  converges  for  some  Xq  in  [a,b\. 
Then  {F„}  converges  uniformly  on  [a,  b]  to  a  differentiable  limit  function  F,  and 

F'(x)  =  lim  F'(x),    a  <x  <  b,  (4.4.11) 

n— »oo 

while 

F'+(a)  =  lim  F^(a+)    and    F'_(b)  =  lim  F„'(/3-).  (4.4.12) 
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Proof    Since  F'n  is  continuous  on  [a,  b],  we  can  write 

Fn(x)  =  Fn(x0)  +  [   F^t)  dt,  a<x< 
(Theorem  3.3.12).  Now  let 


(4.4.13) 


and 


L  =  lim  Fn(x0) 


G(x)  =  lim  F^x). 


(4.4.14) 


Since  F'n  is  continuous  and  {F„}  converges  uniformly  to  G  on  [a,b],  G  is  continuous  on 
[a,  b]  (Corollary  4.4.8);  therefore,  (4.4.13)  and  Theorem  4.4.9  (with  F  and  Fn  replaced  by 
G  and  F^)  imply  that  {Fn }  converges  pointwise  on  [a,  b]  to  the  limit  function 


F(x)  =  L  +  I  G{t)dt. 


f 

J  xa 


(4.4.15) 


The  convergence  is  actually  uniform  on  [a,b],  since  subtracting  (4.4.13)  from  (4.4.15) 
yields 

\F(x)  -  Fn(x)\  <\L-  Fn(x0)\  +  I  /  \G(t)  -  F^(t)\ dt 
<  \L-  F„(x0)\  +  \x-x0\  \\G  -  F^\\[aib], 


\\F-Fn\\[aM  <  \L-Fn(x0)\+{b-a)\\G-F^\\[aM, 

where  the  right  side  approaches  zero  as  n  oo. 

Since  G  is  continuous  on  [a,  b],  (4.4.14),  (4.4.15),  Definition  2.3.6,  and  Theorem  3.3.1 1 
imply  (4.4.11)  and  (4.4.12).  TJ 


Infinite  Series  of  Functions 

In  Section  4.3  we  defined  the  sum  of  an  infinite  series  of  constants  as  the  limit  of  the 
sequence  of  partial  sums.  The  same  definition  can  be  applied  to  series  of  functions,  as 
follows. 


Definition  4.4.12  If  {/_/ }|°  is  a  sequence  of  real-valued  functions  defined  on  a  set  D 
of  reals,  then  J] °°=k  f i  ^s  an  infinite  series  (or  simply  a  series)  of  functions  on  D .  The 
partial  sums  of,  Yl^Lk  fj  ^  defined  by 

ii 

Fn  =  J2fj'  n^L 

If  {Fn}™  converges  pointwise  to  a  function  F  on  a  subset  5  of  D,  we  say  that  Y^f=k  fj 
converges  pointwise  to  the  sum  F  on  S,  and  write 

oo 

F=J2fh  "5. 
j=k 
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If  {Fn}  converges  uniformly  to  F  on  5,  we  say  that  YlfLk  fj  converges  uniformly  to  F 
on  S . 

Example  4.4.10  The  functions 

fj(x)  =  x'\    j  >  0. 

define  the  infinite  series 

oo 

on  D  =  (— oo,  oo).  The  «th  partial  sum  of  the  series  is 

F„(x)  =  1  +  x  +  x2  +  ■■■  +  x", 

or,  in  closed  form, 

(l-xn+1 

Fn(x)=\     l-x  ' 

[n  +  1,        x  =  1 

(Example  4.1.11).  We  have  seen  earlier  that  {Fn }  converges  pointwise  to 

F{x)  =  ' 


1  -x 

if  |  x  |  <  1  and  diverges  if  |  x  \  >  1 ;  hence,  we  write 


oo  ^ 

Lx'  =   .      —  1  <  X  <  1. 
1  —  v 


l-x 

1=0 


Since  the  difference 

F(x)-F„(x)  = 


1  -x 

can  be  made  arbitrarily  large  by  taking  x  close  to  1, 

\\F  -  F„||(_w)  =  oo, 

so  the  convergence  is  not  uniform  on  (—1,  1).  Neither  is  it  uniform  on  any  interval  (—1,  r] 
with  —  1  <  r  <  1 ,  since 

ll^-^lk-l,,)^ 

for  every  n  on  every  such  interval.  (Why?)  The  series  does  converge  uniformly  on  any 
interval  [—  r,  r]  with  0  <  r  <  1,  since 

rn+l 

\\F      Fn  II  [— r,r] 


1  -r 

and  lim„^oo  r"  =  0.  Put  another  way,  the  series  converges  uniformly  on  closed  subsets  of 
(-1,1).  ■ 


246    Chapter  4  Infinite  Sequences  and  Series 


As  for  series  of  constants,  the  convergence,  pointwise  or  uniform,  of  a  series  of  functions 
is  not  changed  by  altering  or  omitting  finitely  many  terms.  This  justifies  adopting  the 
convention  that  we  used  for  series  of  constants:  when  we  are  interested  only  in  whether  a 
series  of  functions  converges,  and  not  in  its  sum,  we  will  omit  the  limits  on  the  summation 
sign  and  write  simply  ^  /„ . 

Tests  for  Uniform  Convergence  of  Series 

Theorem  4.4.6  is  easily  converted  to  a  theorem  on  uniform  convergence  of  series,  as  fol- 
lows. 

Theorem  4.4.13  (Cauchy's  Uniform  Convergence  Criterion)  A  series 

fn  converges  uniformly  on  a  set  S  if  and  only  if  for  each  e  >  0  there  is  an  integer  N 
such  that 

\\fn  +  /„+!+•■•  +  fm  lis  <  c    if  m>n>N.  (4.4.16) 
Proof   Apply  Theorem  4.4.6  to  the  partial  sums  of  ^  /„ ,  observing  that 

fn  +  fn  +  1  +  ■■■  +  fm  =  Fm  —  Fn-\ .  J] 

Setting  m  =  n  in  (4.4.16)  yields  the  following  necessary,  but  not  sufficient,  condition 
for  uniform  convergence  of  series.  It  is  analogous  to  Corollary  4.3.6. 

Corollary  4.4.14  If^fn  converges  uniformly  on  S,  then  limn-^oo  ||/«||s  =  0. 

Theorem  4.4.13  leads  immediately  to  the  following  important  test  for  uniform  conver- 
gence of  series. 

Theorem  4.4.15  (Weierstrass's  Test)  The  series  J2fn  converges  uniformly 
on  S  if 

\\fn\\sSMn,    n>k,  (4.4.17) 

where  ^  Mn  <  oo. 

Proof  From  Cauchy's  convergence  criterion  for  series  of  constants,  there  is  for  each 
e  >  0  an  integer  ./V  such  that 

Mn  +  Mn+i  -\  h  Mm  <  e    if    m  >n  >  N, 

which,  because  of  (4.4.17),  implies  that 

WfnWs  +  \\fn+i\\s  +  ■■■  +  WfmWs  <  £    if  m,n>N. 
Lemma  4.4.2  and  Theorem  4.4.13  imply  that     /„  converges  uniformly  on  S.  TJ 
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Example  4.4.11  Taking  Mn  =  l/n2  and  recalling  that 


<  00, 

nz 


we  see  that 

L—  T  and 

converge  uniformly  on  (—00,  00). 

Example  4.4.12  The  series 


X>m  =  £(t^)" 


converges  uniformly  on  any  set  S  such  that 

<  r  <  1,    leS,  (4.4.18) 


1  +  x 

because  if  S  is  such  a  set,  then 

ll/nlls<r" 

and  Weierstrass's  test  applies,  with 

M„  =       rn  <  00. 

Since  (4.4.18)  is  equivalent  to 

— r  r 

<X<   ,      X  €  S, 


1  +  r  1  —  r 

this  means  that  the  series  converges  uniformly  on  any  compact  subset  of  (—1/2,  00). 
(Why?)  From  Corollary  4.4.14,  the  series  does  not  converge  uniformly  on  S  =  (—1/2,  b) 
withfe  <  00  or  on  5  =  [a,  00)  with  a  >  —1/2,  because  in  these  cases  ||/n||s  =  1  for  all 
n.  ■ 

Weierstrass's  test  is  very  important,  but  applicable  only  to  series  that  actually  exhibit  a 
stronger  kind  of  convergence  than  we  have  considered  so  far.  We  say  that  /„  converges 
absolutely  on  S  if  £^  \f„  \  converges  pointwise  on  S,  and  absolutely  uniformly  on  S  if 

|/n  I  converges  uniformly  on  S.  We  leave  it  to  you  (Exercise  4.4.21)  to  verify  that  our 
proof  of  Weierstrass's  test  actually  shows  that  ^  /„  converges  absolutely  uniformly  on  S. 
We  also  leave  it  to  you  to  show  that  if  a  series  converges  absolutely  uniformly  on  S,  then  it 
converges  uniformly  on  5  (Exercise  4.4.20). 

The  next  theorem  applies  to  series  that  converge  uniformly,  but  perhaps  not  absolutely 
uniformly,  on  a  set  S . 
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Theorem  4.4.16  (Dirichlet's  Test  for  Uniform  Convergence)  The  se- 


ries 


n=k 

converges  uniformly  on  S  if  {fn}  converges  uniformly  to  zero  on  S,  $Z(/n+i  —  fn)  con- 
verges absolutely  uniformly  on  S,  and 

\\gk  +  gk+i+---+gn\\s  <M,    n>k,  (4.4.19) 

for  some  constant  M. 

Proof   The  proof  is  similar  to  the  proof  of  Theorem  4.3.20.  Let 

G„=  gk  +  gk+1  H  Vgn, 

and  consider  the  partial  sums  of  Y^n=k  fngn  '■ 

Hn  =  fkgk  +  fk  +  lgk  +  1  +■■■  +  fngn-  (4.4.20) 

By  substituting 

gk  =  Gk    and    g„  =  G„-G„-1,    n  >  k  +  1, 
into  (4.4.20),  we  obtain 

Hn  =  fkGk  +  fk+\(Gk+\  -  Gfc)  H  1-  fn(Gn  -  Gn-i), 

which  we  rewrite  as 

Hn  =  (fk  -  fk+\)Gk  +  (fk+l  -  fk+2)Gk+\  H  1"  (fn-l  ~  fn)Gn-\  +  fnG„, 

or 

Hn  =  /„_i  +  fnGn,  (4.4.21) 

where 

Jn-i  =  {fk  -  fk+i)Gk  +  (fk+i  ~  fk+2)Gk+i  +■■■  +  (fn-l  -  fn)Gn-i.  (4.4.22) 
That  is,  {/„}  is  the  sequence  of  partial  sums  of  the  series 


J2(fj-fj+i)Gj. 


(4.4.23) 


From  (4.4.19)  and  the  definition  of  G/, 


<Mj2\fj(x)-fj+1(x)\,     v  (-  S. 
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so 


J2(fj  -  fj+i)Gj 


<  M 


Now  suppose  that  e  >  0.  Since  5Z(/y  —  fj+i)  converges  absolutely  uniformly  on  5,  The- 
orem 4.4.13  implies  that  there  is  an  integer  N  such  that  the  right  side  of  the  last  inequality 
is  less  than  e  if  m  >  n  >  N .  The  same  is  then  true  of  the  left  side,  so  Theorem  4.4.13 
implies  that  (4.4.23)  converges  uniformly  on  S. 

We  have  now  shown  that  {/„}  as  defined  in  (4.4.22)  converges  uniformly  to  a  limit 
function  /  on  S.  Returning  to  (4.4.21),  we  see  that 

H„  —  J  =  J„-\  —  J  +  fnG„. 

Hence,  from  Lemma  4.4.2  and  (4.4.19), 

\\H„  —  J\\s  <  \\Jn-l  —  J\\s  +  ||/n||s||G„||s 

<  ||/„_i  -  J\\s  +  M\\f„\\s. 

Since  {Jn-\—  J}  and  {/„}  converge  uniformly  to  zero  on  S,  it  now  follows  that  liirin-xx,  \\H„- 
J\\s  =  0.  Therefore,  {Hn}  converges  uniformly  on  S.  H 

Corollary  4.4.17  The  series  J^^L^  fngn  converges  uniformly  on  S  if 

fn+i{x)  <  fnix),    X€S,  n>k, 
{  fn}  converges  uniformly  to  zero  on  S,  and 

\\gk  +  gk+\-\  r  gn\\s  <  M,  n>k, 

for  some  constant  M. 

The  proof  is  similar  to  that  of  Corollary  4.3.21.  We  leave  it  to  you  (Exercise  4.4.22). 


smnx 


Example  4.4.13  Consider  the  series 

00 

E 

n=l 

with  /„  =  l/n  (constant),  gn(x)  =  smnx,  and 

Gn(x)  =  sin  x  +  sin  2x  +  ■  ■  ■  +  sinnx. 
We  saw  in  Example  4.3.21  that 

1 


\G„(x)\  < 


sin(x/2)| 


n  >  1,    n  ^  2kn       (k  =  integer). 
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Therefore,  { ||  G„  ||  5 }  is  bounded,  and  the  series  converges  uniformly  on  any  set  S  on  which 
sinx/2  is  bounded  away  from  zero.  For  example,  if  0  <  S  <  n,  then 

8 

>  sin  - 
2 

if  x  is  at  least  S  away  from  any  multiple  of  2n ;  hence,  the  series  converges  uniformly  on 

00 

S  =   [J  [2kn  +  8,2(k  +  l)n  -8]. 

Since 


k =—00 


E 


smnx 


00,    x  ^  kit 


(Exercise  4.3.32(b)),  this  result  cannot  be  obtained  from  Weierstrass's  test. 
Example  4.4.14  The  series 


E 


(-1)" 

n  +  x2 


satisfies  the  hypotheses  of  Corollary  4.4.17  on  (—00,  00),  with 
1 


/»(*)  = 


n  +  x2 ' 


gn  =  (-1)", 


T2m 


0. 


and 


'2m  +  l 


Therefore,  the  series  converges  uniformly  on  (—00,  00).  This  result  cannot  be  obtained  by 
Weierstrass's  test,  since 

r—  = 

t—'n+x2 


00 


for  all  x . 


Continuity,  Differentiability,  and  Integrability  of  Series 

We  can  obtain  results  on  the  continuity,  differentiability,  and  integrability  of  infinite  series 
by  applying  Theorems  4.4.7,  4.4.9,  and  4.4.11  to  their  partial  sums.  We  will  state  the 
theorems  and  give  some  examples,  leaving  the  proofs  to  you. 

Theorem  4.4.7  implies  the  following  theorem  (Exercise  4.4.23). 

Theorem  4.4.18  IfYlnLk  f»  converges  uniformly  to  F  on  S  and  each  fn  is  contin- 
uous at  a  point  Xq  in  S,  then  so  is  F.  Similar  statements  hold  for  continuity  from  the  right 
and  left. 

Example  4.4.15  In  Example  4.4.12  we  saw  that  the  series 


F(x)  = 
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converges  uniformly  on  every  compact  subset  of  (—1/2,  oo).  Since  the  terms  of  the  series 
are  continuous  on  every  such  subset,  Theorem  4.4.4  implies  that  F  is  also.  In  fact,  we  can 
state  a  stronger  result:  F  is  continuous  on  (—1/2,  oo),  since  every  point  in  (—1/2,  oo)  lies 
in  a  compact  subinterval  of  (—1/2,  oo). 

The  same  argument  and  the  results  of  Example  4.4.13  show  that  the  function 


G(x)  =  J2 


smnx 


n 

n  =  \ 


is  continuous  except  perhaps  at  x^  =  2kir  (k  =  integer) 
From  Example  A  AAA,  the  function 

oo 

H{x)  =  Y(-Vf  — 

— '  n  4- 


n  +  x2 

n  =  l 


is  continuous  for  all  x. 


The  next  theorem  gives  conditions  that  permit  the  interchange  of  summation  and  inte- 
gration of  infinite  series.  It  follows  from  Theorem  4.4.9  (Exercise  4.4.25).  We  leave  it  to 
you  to  formulate  an  analog  of  Theorem  4.4. 10  for  series  (Exercise  4.4.26). 

Theorem  4.4.19  Suppose  that  "Y^L^  fn  converges  uniformly  to  F  on  S  =  [a,b]. 
Assume  that  F  and  f„,  n  >  k,  are  integrable  on  [a,b\.  Then 

I    F(x)dx  =        /  fn(x)dx. 

J  a  ,  J  a 

n=k 

We  say  in  this  case  that  Yl™=k  fn  can  be  integrated  term  by  term  over  [a,  b\. 
Example  4.4.16  From  Example  4.4.10, 

.  oo 

J_  =  yxnj  _1 

1  -x 


oo 

<  X  <  1. 

«=0 

The  series  converges  uniformly,  and  the  limit  function  is  integrable  on  any  closed  subinter- 
val [a,  b]  of  (—  1, 1);  hence, 

dx        °°  '* 


dx, 

n=0  '  " 

SO 


Ja    1      x  „-nJa 


x    ' «  +  l  _  an+1 


log(l-a)-log(l-6)  =  V 

„ =o      n  +  1 


Letting  a  =  0  and  b  =  x  yields 


log(l  -x)  =  -  V]  — — -,  -1<X<1. 

a  n  +  1 
n=0 
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The  next  theorem  gives  conditions  that  permit  the  interchange  of  summation  and  differ- 
entiation of  infinite  series.  It  follows  from  Theorem  4.4.1 1  (Exercise  4.4.28). 

Theorem  4.4.20  Suppose  that  fn  is  continuously  differentiable  on  [a,  b]  for  each  n  > 
k,  ^J^Lk  fn(xo)  converges  for  some  Xq  in  [a,  b],  and  Yln=k  fn  converges  uniformly  on 
[a,b].  Then  Yln=k  f»  converges  uniformly  on  [a,  b]  to  a  differentiable  function  F,  and 


F'(x)  =  J2fn(x)>  Cl<X<b, 


n=k 


while 


F'(a+)  =  £ />+)    and    F'(b-)  =  £  /„>-)■ 

n=k  n=k 

We  say  in  this  case  that  ^^Lj.  fn  can  be  differentiated  term  by  term  on  [a,  b].  To  apply 
Theorem  4.4.20,  we  first  verify  that  Yln^k  fn  (xo)  converges  for  some  .To  in  [a,  b]  and  then 
differentiate  Yln^k  fn  term  by  term.  If  the  resulting  series  converges  uniformly,  then  term 
by  term  differentiation  was  legitimate. 

Example  4.4.17  The  series 

oo 

Ec-1)"- 

'         n  n 

n  =  l 

converges  at  Xq  =  0.  Differentiating  term  by  term  yields  the  series 

oo  I  X 

y(-l)"+1  — sin-  (4.4.25) 
* — '  n1  n 

n  =  \ 

of  continuous  functions.  This  series  converges  uniformly  on  (— oo,  oo),  by  Weierstrass's 
test.  By  Theorem  4.4.20,  the  series  (4.4.24)  converges  uniformly  on  every  finite  interval  to 
the  differentiable  function 


cos  -  (4.4.24) 


oo  - 

F(x)  =  £(-!)"- cos 

* — '  n  n 


-OO  <  X  <  oo, 


and 


oo 

F'(*)  =  I>l)"+1-2 


x 

sin—,     —  oo  <  x  <  oo. 
n 

n  =  \ 


Example  4.4.18  The  series 

c 


oo  „ 

x" 


n\  2!  3! 

n=0 
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converges  uniformly  on  every  interval  [— r,  r]  by  Weierstrass's  test,  because 


— j-^-r,  \x\<r, 


and 


T- 


r" 

<  oo 


for  all  r,  by  the  ratio  test.  Differentiating  the  right  side  of  (4.4.26)  term  by  term  yields  the 
series 


T— —  =  T 

^  (n  -  1)!      ^  n\ 

n  =  l  v  ' 


n=0 

which  is  the  same  as  (4.4.26).  Therefore,  the  differentiated  series  is  also  uniformly  conver- 
gent on  [—  r,  r]  for  every  r,  so  the  term  by  term  differentiation  is  legitimate  and 

E'{x)  =  E{x),     —oo  <  x  <  oo. 

This  is  not  surprising  if  you  recognize  that  E(x)  =  ex . 

Example  4.4.19  Failure  to  verify  that  the  given  series  converges  at  some  point  can 
lead  to  erroneous  conclusions.  For  example,  differentiating 

oo 

X 


V  cos  -  (4.4.27) 

n 

n  =  \ 

term  by  term  yields 

oo  , 

—  sin  — , 

n  n 

n  =  \ 

which  converges  uniformly  on  [— r,  r]  for  every  r,  since 


1  x 
—  sin  — 

n  n 


\x\ 

<  —    (Exercise  2.3.19) 
n1 

<  -4    if    \x\  <  r, 
n2- 


and  1/n2  <  oo.  We  cannot  conclude  from  this  that  (4.4.27)  converges  uniformly  on 
[—  r,  r\.  In  fact,  it  diverges  for  every  x.  (Why?) 


4.4  Exercises 


1.    Find  the  set  S  on  which  {Fn }  converges  pointwise,  and  find  the  limit  function, 
(a)  Fn(x)  =  xn(l-x2)  (b)  Fn(x)  =  nxn(l-x2) 
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(c)  Fn(x)  =  xn(l-xn) 
1  +  x" 


(e)  Fn{x)  = 
(g)  Fn(x 
(i)  Fn(x) 


1  +x2n 
(g)  F„(*)  =  «2(l-cos^) 

(*  +  n)2 


(d)F„(x)  =  sinfl  +  iY 
(f )  F„(x)  =  x  sin  — 
(h)  F„(x)  =  «jce""x2 


jc2  +  n2 

2.  Prove:  If  {Fn}  converges  to  F  on  [a,  b]  and  F„  is  nondecreasing  for  each  n,  then  F 
is  nondecreasing. 

3.  Show  that  the  functions  {  F„ }  of  Example  4.4.4  are  integrable  and  F  =  linin^oo  Fn  (x) 
is  nonintegrable  on  every  finite  interval. 

4.  Prove  Lemma  4.4.2. 


Find  F(x)  =  lim„^oo  Fn(x)  on  5.  Show  that  {Fn}  converges  uniformly  to  F  on 
closed  subsets  of  S,  but  not  on  S. 

a)    Fn(x)  =  x"  sinnx,    S  =  (-1, 1) 


b)    Fn(x)  = 


1 


1  +  x1 


S  =  {x  |  x  ^  ±1} 


yi  sin  x 

c)    Fn(x)  =   r-,    S  =  (0,  oo)  Hint:  5ee  Exercise  2.3.19. 

1  + 

6.  (a)    Show  that  if  {Fn}  converges  uniformly  on  S,  then  {Fn}  converges  uniformly 

on  every  subset  of  5. 

(b)  Show  that  if  {Fn}  converges  uniformly  on  Si,  S2,  ■  ■     Sm,  then  {Fn}  con- 
verges uniformly  on  U/fc=i 

(c)  Give  an  example  where  {Fn}  converges  uniformly  on  each  of  an  infinite  se- 
quence of  sets  Si,  S2,       but  not  on  (JfcLi  ^A:- 

7.  Describe  the  sets  on  which  the  sequences  of  Exercise  4.4.1  converge  uniformly.  Re- 
strict your  attention  to  sets  that  are  the  union  of  finitely  many  intervals  and  singleton 
sets. 

8.  Suppose  that  {Fn}  converges  pointwise  on  [a,  b]  and,  for  each  x  in  [a,  b],  there  is 
an  open  interval  Ix  containing  x  such  that  {Fn }  converges  uniformly  on  Ix  PI  [a,  b]. 
Show  that  {Fn}  converges  uniformly  on  [a,  b]. 

9.  Prove:  If  {Fn}  converges  uniformly  to  F  on  S ,  then  lim^—^oo  ||  Fn  ||  s  —  ||  ^||  s  • 

10.  Prove:  If  {Fn}  converges  uniformly  to  F  on  S,  then  F  is  bounded  on  5  if  and  only 
if  linWoolUFnlls}  <  00. 

11.  Prove:  If  {Fn}  and  {Gn}  converge  uniformly  to  F  and  G  on  5,  then  {Fn  +  Gn} 
converges  uniformly  to  F  +  G  on  5 . 

12.  (a)   Prove:  If  {Fn}  and  {Gn}  converge  uniformly  to  bounded  functions  F  and  G 

on  S,  then  {FnGn}  converges  uniformly  to  FG  on  S. 
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(b)    Give  an  example  showing  that  the  conclusion  of  (a)  may  fail  to  hold  if  F  or 
G  is  unbounded  on  S. 

13.  (a)    Suppose  that  {Fn}  converges  uniformly  to  F  on  (a,  b).  Prove:  If  Xq  <  a  <  b 

and  Ln  =  limx^xo  Fn{x)  exists  (finite)  for  every  n,  then  L  =  limn-xx,  Ln 
exists  (finite)  and 

lim  F(x)  =  L, 

(b)  State  similar  results  for  limits  from  the  right  and  left. 

14.  Find  the  limits. 

/  \          f4  n      x                           z-i  \          f2  dx 
(a)  lim  /    —  sin—  dx  (b)  lim  /   — 

v    /H-*ooJ1    X        n  y    '  n~*ooJQ^  \  +  X2n 

(c)  lim  /    nxe~nx2  dx  (d)  lim  /    (l  +  -)  dx 

15.  Prove  (without  using  Theorem  4.4.10):  If  each  Fn  is  integrable  and  {Fn  }  converges 
uniformly  on  [a,  b],  then  lim„^oo  f  Fn(x)  dx  exists. 

16.  Prove  (without  using  Theorem  4.4.10):  If  each  Fn  is  nondecreasing  and  {Fn}  con- 
verges uniformly  to  F  on  [a,  b],  then 


pb  pb 

lim  /    Fn(x)dx  =  I  F(x)dx. 

n^°°  J  a  J  a 


17.  Use  Weierstrass's  test  to  determine  sets  on  which  the  series  converges  absolutely 
uniformly. 

(a)  £^(^b)  (b)£^(i^) 

(c)Ynxn{\-x)n  (d)Y  J-  

(e)       —  (f )       4  tt-  sintix 

18.  Show  that  if  ^  \an\  <  oo,  then  ^an  cosnx  and  ^an  sinnx  define  continuous 
functions  on  (— oo,  oo). 

19.  (a)   Give  an  example  showing  that  the  following  "comparison  test"  is  invalid:  If 

fn  converges  uniformly  on  5  and  ||g«||s  <  ||/n||s,  then  converges 
uniformly  on  S . 

(b)  This  "comparison  test"  can  be  corrected  by  adding  one  word  to  its  hypothesis 
and  conclusion.  What  is  the  word? 

20.  (a)   Explain  the  difference  between  the  following  statements:  (i)  ^  /„  converges 

absolutely  and  uniformly  on  S;  (ii)  fn  converges  absolutely  uniformly 
on  S. 
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(b)    Show  that  if  £  /„  converges  absolutely  uniformly  on  S,  then  ^  /„  converges 
uniformly  on  S . 

21.  Show  that  the  hypotheses  of  Weierstrass's  test  imply  that  ^  /„  converges  absolutely 
uniformly  on  S . 

22.  Prove  Corollary  4.4.17. 

23.  Prove  Theorem  4.4.18. 

24.  Suppose  that  {an}^°  is  monotonic  and  lim^-xx,  a„  =  0.  Show  that 

00  oo 

^^ansinnx    and  ^^ancosnx 

n  =  l  n  =  l 

define  functions  continuous  for  all  x  ^  2kn  (k  =  integer). 

25.  Prove  Theorem  4.4.19. 

26.  Formulate  an  analog  of  Theorem  4.4. 10  for  series. 

27.  In  Section  4.5  we  will  see  that 

°°  r2n  °°  2«  +  l 

^  =£(-1)"^-    and    sin*  =  D-^TTXnT 

for  all  x,  and  in  both  cases  the  convergence  is  uniform  on  every  finite  interval.  Find 
series  that  converge  to 


(a)    F(x)  =  fX  e~'2  dt    and  (b)    G(x)  =  f  — 
Jo  Jo  t 


dl 


for  all  x . 

28.  Prove  Theorem  4.4.20. 

29.  Show  from  Example  4.4.17  that  $^n=i(— 1)"  sin(x/«)  converges  uniformly  on  any 
finite  interval. 

30.  Prove:  If  0  <  a„+\  <  a„  and  <  oo  for  some  positive  integer  k,  then 
XX- 1)"  sina„x  converges  uniformly  on  any  finite  interval. 

31.  Forn  >  2,  define 

n4(x  —  n  +  l/n3),  n  —  l/n3<x<n, 

fn(x)  =  <  —nA{x  —  n  —  l/n3),  n  <  x  <  n  +  l/n3, 

0,  \x-n\>  l/n3, 


and  let  F(x)  =  *}2™=2  fn(x).  Show  that  f0   F(x)  dx  <  oo,  and  conclude  that  ab- 
solute convergence  of  an  improper  integral       F(x)  dx  does  not  imply  that  liirin^oo  F{x)  = 
0,  even  if  F  is  continuous  on  [0,  oo). 
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4.5  POWER  SERIES 

We  now  consider  a  class  of  series  sufficiently  general  to  be  interesting,  but  sufficiently 
specialized  to  be  easily  understood. 

Definition  4.5.1  An  infinite  series  of  the  form 

oo 

J2an(x-x0)n,  (4.5.1) 

n=0 

where  xq  and  ao,a\,  . . .,  are  constants,  is  called  a  power  series  in  x  —  xq.  ■ 
The  following  theorem  summarizes  the  convergence  properties  of  power  series. 

Theorem  4.5.2  In  connection  with  the  power  series  (4.5.1),  define  R  in  the  extended 
reals  by 

i  =  IS  \an\lln.  (4.5.2) 

In  particular,  R  =  0  iflimn^.^  Iflnl1^"  =  oo,  and  R  =  oo  //lim„^oo  |a„  | =  0.  Then 
the  power  series  converges 

(a)  only  for  x  =  xq  if  R  =  0; 

(b)  for  all  x  if  R  =  oo,  and  absolutely  uniformly  in  every  bounded  set; 

(c)  for  x  in  (xo  —  R,  Xq  +  R)  ifO  <  R  <  oo,  and  absolutely  uniformly  in  every  closed 
subset  of  this  interval. 

The  series  diverges  if\x  —  x$\  >  R.  No  general  statement  can  be  made  concerning  conver- 
gence at  the  endpoints  x  =  Xq  +  R  and  x  =  Xq  —  R  :  the  series  may  converge  absolutely 
or  conditionally  at  both,  converge  conditionally  at  one  and  diverge  at  the  other,  or  diverge 
at  both. 

Proof   In  any  case,  the  series  (4.5.1)  converges  to  ao  if  x  =  xq.  If 

J2\an\rn<oo  (4.5.3) 

for  some  r  >  0,  then  ^  a„  (x  —  xq)"  converges  absolutely  uniformly  in  [^o  —  r,xo  + 
r],  by  Weierstrass's  test  (Theorem  4.4.15)  and  Exercise  4.4.21.  From  Cauchy's  root  test 
(Theorem  4.3.17),  (4.5.3)  holds  if 

iim  (\an\rn)1/n  <  1, 

n— >oo 

which  is  equivalent  to 

r  Urn  |a„|1/n  <  1 

n— too 

(Exercise  4.1.30(a)).  From  (4.5.2),  this  can  be  rewritten  as  r  <  R,  which  proves  the 
assertions  concerning  convergence  in  (b)  and  (c). 

If  0  <  R  <  oo  and  \x  —  xq\  >  R,  then 
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1  1 

R  >  |x-Xo| 


so  (4.5.2)  implies  that 


\a„\1/n  >  -.  r    and  therefore    \a„(x  -  xQ)n  |  >  1 

|x  -  x0| 

for  infinitely  many  values  of  n.  Therefore,  ^2an(x  —  xq)"  diverges  (Corollary  4.3.6)  if 
\x  —  xq\  >  R.  In  particular,  the  series  diverges  for  all  x  ^  xq  if  R  =  0. 

To  prove  the  assertions  concerning  the  possibilities  at  x  =  xq  +  R  and  x  =  xq  —  R 
requires  examples,  which  follow.  (Also,  see  Exercise  4.5.1.)  H 

The  number  R  defined  by  (4.5.2)  is  the  radius  of  convergence  of  *^an(x  —  xq)"  .  If 
R  >  0,  the  open  interval  (xq  —  R,Xq  +  R),  or  (— oo,  oo)  if  R  =  oo,  is  the  interval  of 
convergence  of  the  series.  Theorem  4.5.2  says  that  a  power  series  with  a  nonzero  radius 
of  convergence  converges  absolutely  uniformly  in  every  compact  subset  of  its  interval  of 
convergence  and  diverges  at  every  point  in  the  exterior  of  this  interval.  On  this  last  we  can 
make  a  stronger  statement:  Not  only  does  ^  an(x  —  xq)"  diverge  if  \x  —  Xq\  >  R,  but  the 
sequence  {an(x  —  xq)"}  is  unbounded  in  this  case  (Exercise  4.5.3(b)). 


Example  4.5.1  For  the  series 

sin  njt/6 


we  have 


2" 

l/n 


(x-iy, 


m\an\^=m(lsinn7t/6) 

n—>oo  n—>oo  \        2,n  J 


=  -  lim  (I  smnji/6\)1/n    (Exercise  4.1.30(a)) 

2  n^oo 

=  2-(i)4 

Therefore,  R  =  2  and  Theorem  4.5.2  implies  that  the  series  converges  absolutely  uniformly 
in  closed  subintervals  of  (—1,  3)  and  diverges  if  x  <  —  1  or  x  >  3.  Theorem  4.5.2  does  not 
tell  us  what  happens  when  x  =  —  1  or  x  =  3,  but  we  can  see  that  the  series  diverges  in  both 
these  cases  since  its  general  term  does  not  approach  zero. 

Example  4.5.2  For  the  series 

xn 


£ 


n 

l/n 


lim  \a„\1/n  =  lim  ( ' =  lim  exp  (-  log-)  =  e°  =  1. 

n^oo  «^oo  \  n )  «~>oo       yn       n  J 

Therefore,  R  =  1  and  the  series  converges  absolutely  uniformly  in  closed  subintervals 
of  (—1,  1)  and  diverges  if  \x\  >  1.  For  x  =  —  1  the  series  becomes  £(— l)B/«,  which 
converges  conditionally,  and  at  x  =  1  the  series  becomes  ^  l/n,  which  diverges.  ■ 
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The  next  theorem  provides  an  expression  for  R  that,  if  applicable,  is  usually  easier  to  use 
than  (4.5.2). 

Theorem  4.5.3  The  radius  of  convergence  of  ^a„{x  —  xq)"  is  given  by 


On  +  l 


1 

—  =  lim 

R  n— >oo 


if  the  limit  exists  in  the  extended  reals. 

Proof   From  Theorem  4.5.2,  it  suffices  to  show  that  if 


exists  in  the  extended  reals,  then 


L  =  lim 

n— >oo 


L  =  lim  \an 

n— >oo 


l/n 


(4.5.4) 


(4.5.5) 


We  will  show  that  this  is  so  if  0  <  L  <  oo  and  leave  the  cases  where  L  =  0  or  L  =  oo  to 
you  (Exercise  4.5.7). 

If  (4.5.4)  holds  with  0  <  L  <  oo  and  0  <  e  <  L,  there  is  an  integer  N  such  that 


L-e  < 


(I  i, 


<  L  +  e    if    m  >  N, 


so 

By  induction, 

Therefore,  if 
then 


\am\(L  -e)  <  \am+i\  <  \am\(L  +  e)    if   m  >  N. 


aN\(L-e)n~N  <  \a„\ <  \aN\(L  +  e)n~"    if    n  >  N. 


Kx  =  \aN\{L-  e)~N    and    K2  =  \aN\(L  +  e)~N , 


K11/n(L-€)<\an\l/n  <K'/n(L  +  €). 


l/n   ,  K\jn. 


(4.5.6) 


Since  lim„^oo  Klln  =  1  if  K  is  any  positive  number,  (4.5.6)  implies  that 


L-€<  lim  \a„\1/n  <  lim  \an\i/n  <L  +  e. 


l/n 


Since  e  is  an  arbitrary  positive  number,  it  follows  that 

lim  \an\1/n  =  L, 

n— >oo 


which  implies  (4.5.5). 


TJ 
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Example  4.5.3  For  the  power  series 


n 


lim 

n— >oo 


=  lim 


x 


=  lim 


1 


=  0. 


n^oo  (n  +  1)!     «^-oo  n  +  1 

Therefore,  R  =  oo;  that  is,  the  series  converges  for  all  x,  and  absolutely  uniformly  in  every 
bounded  set. 


Example  4.5.4  For  the  power  series 


lim 

W— >00 


(«  +  1)! 

=  lim   -       =  lim  (n  +  1)  =  oo. 


Therefore,  R  =  0,  and  the  series  converges  only  if  x  =  0. 

Example  4.5.5  Theorem  4.5.3  does  not  apply  directly  to 

(—l)n 

 x2n    (p  =  constant). 

4nnP 


(4.5.7) 


which  has  infinitely  many  zero  coefficients  (of  odd  powers  of  x).  However,  by  setting 
y  =  x2,  we  obtain  the  series 


which  has  nonzero  coefficients  for  which 


(4.5.8) 


lim 

n— >oo 


4"«p  1         /       IV  1 

lim  — —   =  -  lim  I  1  H —  I      =  -. 

n^oo  4n+1(n  +  l)P      4  «^oo  y       n  J  4 


Therefore,  (4.5.8)  converges  if  \y\  <  4  and  diverges  if  \y\  >  4.  Setting  y  =  x2,  we 
conclude  that  (4.5.7)  converges  if  \x\  <  2  and  diverges  if  |jc|  >  2.  At  x  =  ±2,  (4.5.7) 
becomes  ^2(—l)"/np,  which  diverges  if  p  <  0,  converges  conditionally  if  0  <  p  <  1,  and 
converges  absolutely  if  p  >  1 . 


Properties  of  Functions  Defined  by  Power  Series 

We  now  study  the  properties  of  functions  defined  by  power  series.  Henceforth,  we  consider 
only  power  series  with  nonzero  radii  of  convergence. 

Theorem  4.5.4  A  power  series 

oo 

/OO  =  ^a„(x  -x0)n 

n=0 
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with  positive  radius  of  convergence  R  is  continuous  and  differentiable  in  its  interval  of 
convergence,  and  its  derivative  can  be  obtained  by  differentiating  term  by  term;  that  is, 

oo 

fl(x)  =  Yjna„(x-x0)n-\  (4.5.9) 
n=l 

which  can  also  be  written  as 

oo 

f{x)  =  +  l)an+1(x-x0)".  (4.5.10) 

n=0 

This  series  also  has  radius  of  convergence  R. 

Proof  First,  the  series  in  (4.5.9)  and  (4.5.10)  are  the  same,  since  the  latter  is  obtained 
by  shifting  the  index  of  summation  in  the  former.  Since 

Urn  ((«  +  l)|<3„|)1/n  =  lhH  («  +  l)1/n\an\1/n 


(  lim  (n  +  l)1/n)  (  lim  |a„|1/n)     (Exercise  4.1.30(a)) 

VJ— i >oo  /  \n— >oo  / 

lim  expfl0g("+1))l  f  Urn  \an\^)  =  -  =  -, 
^oo       \       n        J  j  \«^oo  /       R  R 


the  radius  of  convergence  of  the  power  series  in  (4.5.10)  is  R  (Theorem  4.5.2).  Therefore, 
the  power  series  in  (4.5.10)  converges  uniformly  in  every  interval  [xq  —  r,x$  +  r]  such  that 
0  <  r  <  R,  and  Theorem  4.4.20  now  implies  (4.5.10)  for  all  x  in  (x0  -  R,x0  +  R).  TJ 

Theorem  4.5.4  can  be  strengthened  as  follows. 
Theorem  4.5.5  A  power  series 

oo 

fix)  =  ^an(x-xQ)n 

n=0 

with  positive  radius  of  convergence  R  has  derivatives  of  all  orders  in  its  interval  of  convergence, 
which  can  be  obtained  by  repeated  term  by  term  differentiation;  thus, 

oo 

f{k\x)  =  ^n(n-l)---(n-k  +  X)an{x-x0)n-k.  (4.5.11) 

n=k 

The  radius  of  convergence  of  each  of  these  series  is  R. 

Proof  The  proof  is  by  induction.  The  assertion  is  true  for  k  =  1,  by  Theorem  4.5.4. 
Suppose  that  it  is  true  for  some  k  >  1.  By  shifting  the  index  of  summation,  we  can  rewrite 
(4.5.11)  as 

OO 

f{k)(x)  =  J2(n+kXn+k-l)---(n+l)an+k(x-x0)n,     \x  -  x0\  <  R. 

n=0 
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b„  =  (n+k)(n  +k-l)---(n+  l)an+k,  (4.5.12) 

we  rewrite  this  as 

oo 

f«\x)  =  J2bn(x-x0)n,  \x-x0\<R. 

n=0 

By  Theorem  4.5.4,  we  can  differentiate  this  series  term  by  term  to  obtain 

oo 

f{k+l)(x)  =  J2nbn(x-x0)n-\     \x-xQ\  <  R. 
n=\ 

Substituting  from  (4.5.12)  for  b„  yields 

oo 

f{k+1)(x)  =  J2(n+k)(»  +k-l)---{n  +  l)nan+k(x  -  xQ)n~\     \x-xQ\  <  R. 
n=l 

Shifting  the  summation  index  yields 

oo 

f(k+1\x)=  n{n-l)---{n-k)an{x-x0)n-k-\  \x-x0\<R, 

n=k+\ 

which  is  (4.5.1 1)  with  k  replaced  by  k  +  1.  This  completes  the  induction.  TJ 
Example  4.5.6  In  Example  4.4.10  we  saw  that 


1  00 

— —  =  sTxn 

l-x 


x\  <  1. 

n=0 


Repeated  differentiation  yields 

.  |  oo 

 =  Yjn(n-\)---{n-k+  l)x» 


n=0 

SO 


Yu(n  +  k)(n  +  k-\)---{n  +  \)xn,  \x\<\, 

12  =  0 

l_  =  y(n  +  k\ 


\x\  <  1. 


(l-x) 

Example  4.5.7  By  the  method  of  Example  4.5.5,  it  can  be  shown  that  the  series 

00  2«  +  l  00  2n 

S(x)  =  Vf-iy   and     C(x)  =  Y(-l)n^— 

^         (2n  +  1)!  K  '     4i  (2n)\ 
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converge  for  all  x .  Differentiating  yields 

00  n 

=  £(-!)»  — -  =  C(x) 

and 

00  2n-l  00  2«  +  l 

c'(jc)  =  Y(-iy—  =  -  Y(-ir—  =  -s(x). 

v;     ^      '  (2re  - 1)!        ^      '  (2re  +  1)!  w 

«  =  1  v  7  n=0  v  ' 

These  results  should  not  surprise  you  if  you  recall  that 

S(x)  =  sinx    and    C(x)  =  cosx. 
(We  will  soon  prove  this.)  ■ 

Theorem  4.5.5  has  two  important  corollaries. 

Corollary  4.5.6  // 

00 

f(x)  =  ^2an(x-x0)n,  \x-x0\<R, 

n=0 

then 

an  =   :  • 

re! 

Proof    Setting  x  =  x0  in  (4.5.11)  yields 

f{k\x0)  =  k\ak.  e 

Corollary  4.5.7  (Uniqueness  of  Power  Series)  // 

00  00 

J2an(x-x0)"  =  J2bn(x-x0)n  (4.5.13) 

n=0  n=0 

for  all  x  in  some  interval  (xo  —  r,  Xo  +  r),  then 

an=bn,    n  >  0.  (4.5.14) 

Proof  Let 

00  00 
/(*)  =  X!  a»  (x  ~  x°)"    and    S(x)  =  Y1  bn  (x  -  x0)" . 

n=0  n=0 

From  Corollary  4.5.6, 

finHxo)        ,    ,  gW(x0) 

an  =    and    bn  =   .  (4.5.15) 

n !  n ! 
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From  (4.5.13),  /  =  g  in  (xq  —  r,  xq  +  r).  Therefore, 

f(n)(xo)=g(n\x0),  n>0. 

This  and  (4.5.15)  imply  (4.5.14).  TJ 

Theorems  4.4.19  and  4.5.2  imply  the  following  theorem.  We  leave  the  proof  to  you 
(Exercise  4.5.15). 

Theorem  4.5.8  If Xi  and  Xj  are  in  the  interval  of  convergence  of 

oo 

f(x)  =  ^2ia„(x  -Xo)n, 

n=0 

then 

oo 

f(x)  dx=T  -^-r  [(X2  -X0r  +  1-(Xl-  X0)"+1]  ■ 
„  =0  "  +  1 

that  is,  a  power  series  may  be  integrated  term  by  term  between  any  two  points  in  its  interval 
of  convergence. 

Example  4.5.16  presents  an  application  of  this  theorem. 


Taylor's  Series 

So  far  we  have  asked  for  what  values  of  x  a  given  power  series  converges,  and  what  are 
the  properties  of  its  sum.  Now  we  ask  a  related  question:  What  properties  guarantee  that  a 
given  function  /  can  be  represented  as  the  sum  of  a  convergent  power  series  in  x  —  xq!  A 
partial  answer  to  this  question  is  provided  by  what  we  already  know:  Theorem  4.5.5  tells  us 
that  /  must  have  derivatives  of  all  orders  in  some  neighborhood  of  xo,  and  Corollary  4.5.6 
tells  us  that  the  only  power  series  in  x  —  xq  that  can  possibly  converge  to  /  in  such  a 
neighborhood  is 

fM(x0), 


E 


n\ 


-(x-x0)n.  (4.5.16) 


This  is  called  the  Taylor  series  of  f  about  Xq  (also,  the  Maclaurin  series  of  /,  if  Xq  =  0). 
The  mth  partial  sum  of  (4.5.16)  is  the  Taylor  polynomial 

z— '  n\ 

n=0 

defined  in  Section  2.5. 

The  Taylor  series  of  an  infinitely  differentiable  function  /  may  converge  to  a  sum  dif- 
ferent from  / .  For  example,  the  function 
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is  infinitely  differentiable  on  (—00,  00)  and  =  0  for  n  >  0  (Exercise  2.5.1),  so  its 

Maclaurin  series  is  identically  zero. 

The  answer  to  our  question  is  provided  by  Taylor's  theorem  (Theorem  2.5.4),  which 
says  that  if  /  is  infinitely  differentiable  on  (a,  b)  and  x  and  xq  are  in  (a,  b)  then,  for  every 
integer  n  >  0, 

f(x)-Tn(x)=  J—— \£L(x-x0)"-\  (4.5.17) 
(n  +  1)! 

where  cn  is  between  x  and  xq.  Therefore, 

z— '  «! 

«=o 

for  an  .r  in  {a,  b)  if  and  only  if 

f  («+!)/>  ^ 

It  is  not  always  easy  to  check  this  condition,  because  the  sequence  {c„}  is  usually  not  pre- 
cisely known,  or  even  uniquely  defined;  however,  the  next  theorem  is  sufficiently  general 
to  be  useful. 

Theorem  4.5.9  Suppose  that  f  is  infinitely  differentiable  on  an  interval  I  and 

lim  !_||/W||Z  =  0.  (4.5.18) 

Then,  ifxo  €  7°,  the  Taylor  series 

'—'  n\ 

n=0 

converges  uniformly  to  f  on 

Ir  =  I  n  [x0  -  r,  x0  +  r\. 

Proof   From  (4.5.17), 

r«+i  r«+i 

\\f-Tn\\Ir  <  r— rrll /(n+1) II/,  <  r— tttII/^II/, 

(«  +  1)!  («  +  1)! 

so  (4.5.18)  implies  the  conclusion.  H 

Example  4.5.8  If  f(x)  =  sin x,  then  \\f    ||(— 00,00)  = 

1,  k  >  0.  Since 

r" 

lim  — -  =  0,  0<r<oo 
n— >oo  n ! 
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(Example  4.1.12),  (4.5.18)  holds  for  all  r.  Since 

/(2m)(0)  =  0    and    /(2m+1)(0)  =  (-l)m,  m>0, 

we  see  from  Theorem  4.5.9,  with  /  =  (-co,  oo),  xq  =  0,  and  r  arbitrary,  that 

00  x2n+1 

sinx  =  >  (— 1)"  ,     — oo  <  x  <  oo, 

^         (2n  +  1) 

and  the  convergence  is  uniform  on  bounded  sets. 
A  similar  argument  shows  that 

°°  x2n 

cosx  =  V(-l)"— — ,     -oo  <  X  <  oo, 

^-^  (2«) 
n=o         v  7 

with  uniform  convergence  on  bounded  sets. 

Example  4.5.9  If  f(x)  =  ex,  then  f(k)(x)  =  ex  and  =  er ,  k  >  0,  if 

/  =  [— r,  r\.  Since 

lim  —  er  =  0. 

we  conclude  as  in  Example  4.5.8  that 

n=0 

with  uniform  convergence  on  bounded  sets. 
Example  4.5.10  If  f(x)  =  (1  +  x)q ,  then 


-oo  <  X  <  oo, 


/WW      <9Xl+xr,    so  (4.5.19) 


9  W" 


(Example  2.5.3).  The  Maclaurin  series 

oo 

E 

«=o 

is  called  the  binomial  series.  We  saw  in  Example  2.5.3  that  this  series  equals  (1  +  x)q  for 
all  x  if  q  is  a  nonnegative  integer.  We  will  now  show  that  if  q  is  an  arbitrary  real  number, 
then 

]T  [q  \xn  =  f{x)  =  (1  +  x)q,    0<x<l.  (4.5.20) 

n=0  W 

Since 
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lim 

n— >oo 


[n  +  1 


=  lim 


q—n 


n  +  1 


=  1, 


the  radius  of  convergence  of  the  series  in  (4.5.20)  is  1.  From  (4.5.19), 

ll[o,i] 


||y(«)  I 


<  [max(l,2g)] 


1 


,    n  >  0. 


Therefore,  if  0  <  r  <  1, 


lim  _||/W||[0>1]  <  [max(l,2«)]  lim 


r"  =  0, 


where  the  last  equality  follows  from  the  absolute  convergence  of  the  series  in  (4.5.20)  on 
(-1,  1).  Now  Theorem  4.5.9  implies  (4.5.20).  ■ 

We  cannot  prove  in  this  way  that  the  binomial  series  converges  to  (1  +  x)q  on  (—1,0). 
This  requires  a  form  of  the  remainder  in  Taylor's  theorem  that  we  have  not  considered,  or 
a  different  kind  of  proof  altogether  (Exercise  4.5.20).  The  complete  result  is  that 


n=Q 


(1  +X)4  =  lq \x",  -1<X<1 


(4.5.21) 


for  all  q,  and,  as  we  said  earlier,  the  identity  holds  for  all  x  if  q  is  a  nonnegative  integer. 
Arithmetic  Operations  with  Power  Series 

We  now  consider  addition  and  multiplication  of  power  series,  and  division  of  one  by  an- 
other. 

We  leave  the  proof  of  the  next  theorem  to  you  (Exercise  4.5.21). 
Theorem  4.5.10  // 


LA., 

f{x)  =  ^a„(x-x0)n,  \x-x0\<R\, 

n=0 

oo 

g(x)  =  ^bn(x  -x0)n,  \x-x0\<R2, 

n=0 

and  a  and  ft  are  constants,  then 

oo 

af{x)  +  Pg(x)  =  ^2(aan  +  pbn)(x  -  x0)n,     \x  -  x0\  <  R, 

n=0 

where  R  >  min{i^i,  i?2}- 


(4.5.22) 


(4.5.23) 
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Theorem  4.5.11  // /  and  g  are  given  by  (4.5.22)  and  (4.5.23),  then 

00 

f(x)g{x)=YJCn(x-x0)\     \x-x0\<R,  (4.5.24) 

«=0 

n  n 

where  1     •  ST^  , 

cn  =  2_^arbn^r  =  2_^a„-rbr 

r=0  r=0 

and  R  >  min{.Ri,  i^}- 

Proof  Suppose  that  R\  <  i?2-  Since  the  series  (4.5.22)  and  (4.5.23)  converge  abso- 
lutely to  / (x)  and  g(x)  if  |x  —  Xo\  <  Ri,  their  Cauchy  product  converges  to  / (x)g(x)  if 
\x  —  xq\  <  Ri,  by  Theorem  4.3.29.  The  «th  term  of  this  product  is 


n  /  n  \ 

^ar(x  -  x0)rbn-r(x  -  x0)n~r  =  I  ^2arb„-r  J  (x  -  x0)n  =  c„(x  -  x0)n . 

r=0  \r=0  / 


TJ 


Example  4.5.11  If 

1  00 

/to  =  - —  =  i-Ti<1' 

1  —  x  '—' 


n=0 


and 


then 


g(x)  =  YJbnxn,  \x\<R, 


n=0 


l-x 


=  ^i„x",     \x\  <  min{l,  R}, 


where 


s„  =  (1)60  +  (l)6i  +  —  +  0)bn 
=  b0  +  bi+---  +  b„. 

Example  4.5.12  From  the  paragraph  following  Example  4.5.10, 


n=0 

and 


(i+xy  =  jr/(p)xn,  \x\<i, 

n=0  \  / 


(i+jt)«  =  f;(y,  w<i. 
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n=0 


n 


-»-E  L  )(„_, 


r=0 


Since 

oo     /  \ 

(1  +         +  *)«  =  (1  +  xY+o  =         P  ' 

while  the  Cauchy  product  is  Yl^n=Q  cnXn ,  with 

«    /  \ 

p  \  I  q 


Corollary  4.5.7  implies  that 

This  yields  the  identity 

valid  for  all  p  and  q . 
The  quotient 


P+9 

,  n 


p  +  q\         p\  i 


E 


n     I  1 — '  \  r    \n  —  r 


f(x)  =  ^  (4-5.25) 


of  two  power  series 


and 


CO 

h(x)  =  ^2c„(x  -  x0)n ,  \x-x0\<R1 


n=0 


g(x)  =  ^2  b„(x  -  x0)n ,  \x-x0\<R2, 


n=0 

can  be  represented  as  a  power  series 

oo 

f(x)=J2an(x-x0)n  (4.5.26) 

n=0 

with  a  positive  radius  of  convergence,  provided  that 

b0  =  g(x0)  ^  0. 

This  is  surely  plausible.  Since  g(xo)  ^  0  and  g  is  continuous  near  xq,  the  denominator 
of  (4.5.25)  differs  from  zero  on  an  interval  about  xq.  Therefore,  /  has  derivatives  of  all 
orders  on  this  interval,  because  g  and  h  do.  However,  the  proof  that  the  Taylor  series  of  / 
about  Xo  converges  to  /  near  xq  requires  the  use  of  the  theory  of  functions  of  a  complex 
variable.  Therefore,  we  omit  it.  However,  it  is  straightforward  to  compute  the  coefficients 
in  (4.5.26)  if  we  accept  the  validity  of  the  expansion.  Since 

f(x)g(x)  =  h(x), 
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Theorem  4.5.1 1  implies  that 

n 

^2<2rbn-r  =  c„,    n  >  0. 

r=0 

Solving  these  equations  successively  yields 

Co 

tfo  =  r-, 
bo 

l    /        n-l  \ 
an  =  y  Un-J2bn-rar\,  «>1. 


It  is  not  worthwhile  to  memorize  these  formulas.  Rather,  it  is  usually  better  to  view  the 
procedure  as  follows:  Multiply  the  series  /  (with  unknown  coefficients)  and  g  according 
to  the  procedure  of  Theorem  4.5.1 1,  equate  the  resulting  coefficients  with  those  of  h,  and 
solve  the  resulting  equations  successively  for  ao,  a\,  

Example  4.5. 13  Suppose  that  we  wish  to  find  the  coefficients  in  the  Maclaurin  series 

tanjc  =  ao  +  a\x  +  ajx2  +  ■  ■  ■  . 

We  first  observe  that  since  tan  x  is  an  odd  function,  its  derivatives  of  even  order  vanish  at 
Xo  =  0,  so  ci2m  =  0,  m  >  0.  Therefore, 

tanx  =  a\x  +  a^x3  +  CI5X5  +  •■•  . 

Since 

sin* 
tan*  =   , 


cos  x 


it  follows  from  Example  4.5.8  that 


x  1  h 

3  ,  „  „5  ,  6  120 


CL\X  +  CI3X    +  CI5X  + 


Y+24  + 


so 


f         X  X  \  X  X 

(aix  +  a3x3  +  a5x5  +  ...)M-_  +  _  +  ...\=x-  —  +  —  + 


or,  according  to  Theorem  4.5.1 1, 

3  5 

c  x 

24/"  6~  +  120 


/        a\\    ,     /        «3      ai\    s  x3  x5 

a\x  +  ( c/3  I  x  +  ( as  1  I  x  H  =  x  1  1  . 

V         2  /         V         2      24/  6  120 


From  Corollary  4.5.7,  coefficients  of  like  powers  of  x  on  the  two  sides  of  this  equation 
must  be  equal;  hence, 

a\  1  A3      a\  1 

fli  =  1,  03  =  — ,  a5  1  =  , 

2  6  2      24  120 

so 

2 


111  11 n\  1 

ay  =  1,    a3  =  1--(1)=  -,     05  =  h  -  (1)  = 

6     2  3  120     2  \3j     24  J 


15 
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Therefore, 

x3      2  J 

tan  X  —  x  -\  1  x  +  ■•■  . 

3  15 

Example  4.5.14  To  find  the  reciprocal  of  the  power  series 

oo  „ 

g(x)  =  l  +  ex=2  +  J2X 

we  let  h  =  1  in  (4.5.25).  If 

— — -  =  Y\a„xn, 


«=i 


then 


n=0 


/  X2        X3  \ 

1  =  (ao  +  ai*  +        +  fl3-^3  H  M2  +  'x  +  ^~  +  :6""'  ) 


=  2a0  +  (a0  +  2a\)x  +  ^—  +  ci\  +  2a2j 
+  (y +  y  +a2  +  2a3) 
From  Corollary  4.5.7, 


A"2 


\X3  +■ 


2uq  =  1, 
ao  +  2fli  =  0, 

ao 

—  +  a\  +  2a2  =  0, 

CLq  Cl\ 

—  +  —  +  a2  +  2a3  =  0. 
o  2 


Solving  these  equations  successively  yields 


a0 


1 

2' 

££.  -  _I 

"2  ~  4' 


a2  =  -I(f +fll)=-i(i-I)=0, 

fl3  =  -2(T  +  T+fl2)=-2(l2-8+0 


1 

48' 


1     x  x3 

r  -  T  +  TTT  + 


1  +  e*      2     4  48 
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x" 
hT 


Example  4.5.15  To  find  the  reciprocal  of 

g(x)  =e*  =  J2- 
we  again  let  h  =  1  in  (4.5.25).  If 

CO 

«=o 

then 

/  oo  \    /  OO      „  \  00 


;;=0 


where 


r=0  v  7 


From  Corollary  4.5.7,  Co  =  «o  =  1  and  c„  =  0  if  n  >  1;  hence, 


a„  =-J2 


ar 


r=0 


(n  —  r)\ 


n  >  1. 


Solving  these  equations  successively  for  ao,  a\,  ■■■  yields 
1 


Cl\ 

a2  =  - 

a3  =  - 

«4  =  — 
From  this,  we  see  that 


1! 


(4.5.1)  =  -1. 

1  111 

_(1) +  _(-!)]=_, 

+  ^  +  If  G 


i       i         i  /n 

4!(1)+3!(-1)+2!    2  + 


1f~ 
1!  V  6 


(-1)* 


1 

24' 


(4.5.27) 


(4.5.28) 


for  0  <  /c  <  4  and  are  led  to  conjecture  that  this  holds  for  all  k.  To  prove  this  by  induction, 
we  assume  that  it  is  so  for  0  <  k  <  n  —  1  and  compute  from  (4.5.28): 

^      1  (-l)r 

a„  =-)  — — 

"  (n  -  r)  r 

r=0  v  ' 
n-1 


r=0 
(-1)" 


-}  ^(-l)r  ( "  J      (Exercise  1.2.19(a)) 
(Exercise  1.2.19(b)). 
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Thus,  we  have  shown  that 

oo  „ 

(^r1  =  £(-if  V 

n=0 

Since  this  is  precisely  the  series  that  results  if  x  is  replaced  by  —  x  in  (4.5.27),  we  have 
verified  a  fundamental  property  of  the  exponential  function:  that 

(exyx  =  e~x. 

This  also  follows  from  Example  4.3.26. 

Abel's  Theorem 

From  Theorem  4.5.4,  we  know  that  a  function  /  defined  by  a  convergent  power  series 

oo 

f{x)  =  J2an(x-x0)n,     \x-x0\<R,  (4.5.29) 

«=0 

is  continuous  in  the  open  interval  (xq  —  R,  xq  +  R).  The  next  theorem  concerns  the  behavior 
of  /  as  x  approaches  an  endpoint  of  the  interval  of  convergence. 

Theorem  4.5.12  (Abel's  Theorem)  Let  f  be  defined  by  a  power  series  (4.5.29) 
with  finite  radius  of  convergence  R. 

(a)  V  IZnLo  anRn  converges,  then 

oo 

lim      f{x)  =  y^anRn. 
*-K*o+*)-  *-L 

n=0 

(b)  If^^Loi—  l)nanRn  converges,  then 

oo 

lim      fix)  =  Yi-\)na„Rn. 

«=0 

Proof   We  consider  a  simpler  problem  first.  Let 

oo 

g(y)  =  J2b»y" 

n=0 

and 


^  bn  =  s  (finite). 


«=o 

We  will  show  that 


lim  giy)  =  s.  (4.5.30) 
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From  Example  4.5.1 1, 

oo 
n=0 

where 

sn  =  b0  +  b\  H  Ybn. 

Since 


(4.5.32) 


j  OO  OO 

  =  V*  y»    and  therefore     1  =  (1  -  y)  V  y",     |y|  <  1, 

1  —  v      ^— '  ' 

^        «=0  n=0 

we  can  multiply  through  by  s  and  write 

OO 

n=0 

Subtracting  this  from  (4.5.31)  yields 

OO 

g(y)-s  =  (l-y)J2(sn-s)yn,  |y|<l. 

n=0 

If  e  >  0,  choose  N  so  that 

\sn-s\  <  e    if   n  >  N  +  I. 

Then,  if  0  <  y  <  1, 

JV  oo 

igoo-*i<(i-j)I>«-*ij"  +  (i-j)  £ 

n=0  n=N  +  l 

N  oo 

<  a  -  y)    \s»  -  s\y"  +  o  -        E  y" 

«=0  n=0 

iV 

<  (i-jo^k-si +  e, 

because  of  the  second  equality  in  (4.5.32).  Therefore, 

\g(y)-s\  <2e 

if 

N 

This  proves  (4.5.30). 

To  obtain  (a)  from  this,  let  bn  =  anRn  and  g(y)  =  f(xo  +  Ry);  to  obtain  (b),  let 
bn  =  (-l)"anRn  and  g(y)  =  f(xo-Ry).  U 
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Example  4.5.16  The  series 

1  00 

f(x)  =  -  =  V(-1)"jc" 

1  +  x 


n=0 


diverges  at  x  =  1,  while  limx_^i_  f(x)  =  1/2.  This  shows  that  the  converse  of  Abel's 
theorem  is  false.  Integrating  the  series  term  by  term  yields 


loga+x^yvir— T,  i*i<i, 


where  the  power  series  converges  at  x  =  1,  and  Abel's  theorem  implies  that 

(_!)»  +  ! 


n=0 


n  +  1 


Example  4.5.17  If  q  >  0,  the  binomial  series 


«=o 


converges  absolutely  for  x  =  ±  1 .  This  is  obvious  if  g  is  a  nonnegative  integer,  and  it 
follows  from  Raabe's  test  for  other  positive  values  of  q,  since 


'  q 
lit  + 1 


n  —  q 
n  +  1 ' 


«  > 


and 


lim  n  (     "+1   —  1  )  =  lim  n  \   —  —  1  ) 

«->oo     \     a„  J       n^oo     \n  +  1  J 


=  lim 


«^oo  n  +  1 

Therefore,  Abel's  theorem  and  (4.5.21)  imply  that 


{-q-\)  =  -q-\. 


«=o 


«=0 


4.5  Exercises 


1.  The  possibilities  listed  in  Theorem  4.5.2(c)  for  behavior  of  a  power  series  at  the 
endpoints  of  its  interval  of  convergence  do  not  include  absolute  convergence  at  one 
endpoint  and  conditional  convergence  or  divergence  at  the  other.  Why  can't  these 
occur? 
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2.  Find  the  radius  of  convergence. 

(*)^fcL±±\     [2 +  (-!)"]"  x"  (b)^2^-l)» 

WES" 

3.  (a)   Prove:  If  {a„rn}  is  bounded  and  |xi  —  xq\  <  r,  then  ^an{x\  —  xq)"  con- 

verges. 

(b)   Prove:  If  *^an(x  —  xq)"  has  radius  of  convergence  R  and  \x\  —  Xo\  >  R, 
then  {a„  (xi  —  xq)"  }  is  unbounded. 

4.  Prove:  If  g  is  a  rational  function  defined  for  all  nonnegative  integers,  then  £  anxn 
and     Ong(n)xn  have  the  same  radius  of  convergence.  Hint:  Use  Exercise  4.1.30(a). 

5.  Suppose  that  f(x)  =  £a„(x  —  xo)"  has  radius  of  convergence  R  and  0  <  r  < 
R\  <  R.  Show  that  there  is  an  integer  k  such  that 

k 


f(x)  -  ^a„(x  -xo)" 


n=0 


#1 


Rx-r 


if  [jc  —  jco |  <  r  and  k  >  k. 
6.     Suppose  that  £  is  a  positive  integer  and 


fix)  =  J2anx" 

n=0 

has  radius  of  convergence  R.  Show  that  the  series 

oo 

g(X)  =  f(xk)=J2^nXk" 
n=0 

has  radius  of  convergence  R1'*. 

7.  Complete  the  proof  of  Theorem  4.5.3  by  showing  that 

(a)  R  =  0iflim„-Hx,  \a„+i\/\a„\  =  oo; 

(b)  R  =  oo  if  lirrjn-Hx,  |a„+i|/|a„|  =  0. 

8.  Find  the  radius  of  convergence, 
(a)  £(log«)x" 

f2n 
n 


(b)  £2V(x  +  1)" 
«2  +  1 


(d)EH)^(-ir 
(f)E"'" 


a  (a  +  1)  •  ■  •  (a  +  n  -  1) 


/SOS  +  1)---0S  +  n  -  1) 
(a,  /3     negative  integer) 
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9.     Suppose  that  an  ^  0  for  n  sufficiently  large.  Show  that 


(a)  lim 


<  lim  \an\xln    and(b)      lim  \an\1/n  <  lim 


l/n 


Show  that  this  implies  Theorem  4.5.3. 
10.     Given  that 

1 


1  -x 


«=0 


use  Theorem  4.5.4  to  express  ^^L0  «  x"  in  closed  form. 
1 1 .     The  function 


~        (_!)"  /M2»+f 

^  n\(n  +  p)\  \2/ 


is  the  Bessel  function  of  order  p.  Show  that 

(a)  = 

(b)  J'p  =  /,>  1. 

(c)  x2/;  +  x/;  +  (x2-,p2)/p  =  0. 

12.  Given  that  the  power  series  /(x)  =  Y^T=o  anXn  satisfies 

f'(x)  =  -2x/(x),    /(0)  =  1, 

find         Do  you  recognize  /? 

13.  Let 


and  g-(x)  =  / (x  ),  where  k  is  a  positive  integer.  Show  that 

g(r)(0)  =  0    if    r^kn    and    g(kn)  (Q)  =  ^-^/(n)(0).  n>(). 

14.  Let 


fix)  =      a„jx  -  x0)n,  \x-xo\<R, 


n=0 

and  f(tn)  =  0,  where  tn  ^  xq  and  lim„^oo  tn  =  xq.  Show  that  fix)  =  0 
(|x  —  Xo|  <  R).  HINT:  Rolle's  theorem  helps  here. 

15.  Prove  Theorem  4.5.8. 

16.  Express 

log* 
M  f-1 


i: 


-dt 


as  a  power  series  in  x  —  1  and  find  the  radius  of  convergence  of  the  series. 
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17.  By  substituting  —  x2  for  x  in  the  geometric  series,  we  obtain 

1  00 
-—j  =  |x|<l. 

Use  this  to  express  /(x)  =  Tan_1x  (f(0)  =  0)  as  a  power  series  in  x.  Then 
evaluate  all  derivatives  of  /  at  xq  =  0,  and  find  a  series  of  constants  that  converges 
to  it/ 6. 

18.  Prove:  If 

oo 

/(*)  =/,<*n(x-  Xp)n,  \X-X0\<R, 
n=0 

and  F  is  an  antiderivative  of  /  on  (xq  —  R,  xq  +  R),  then 

oo 

F(x)  =  C +  Y^-(x-x0)n+1,  \x-x0\<R, 
f— '  n  +  1 

n=0 

where  C  is  a  constant. 

19.  Suppose  that  some  derivative  of  /  can  be  represented  by  a  power  series  in  x  —  xq 
in  an  interval  about  xq.  Show  that  /  and  all  its  derivatives  can  also. 

20.  Verify  Eqn.  (4.5.21)  by  showing  that 

n=0  W 

HINT:  Differentiate. 

21.  Prove  Theorem  4.5.10. 

22.  Find  the  Maclaurin  series  of  cosh  x  and  sinhx  from  the  definition  in  Eqn.  (4.5.16), 
and  also  by  applying  Theorem  4.5.10  to  the  Maclaurin  series  for  ex  and  e~x . 

23.  Give  an  example  where  the  radius  of  convergence  of  the  product  of  two  power  series 
is  greater  than  the  smaller  of  the  radii  of  convergence  of  the  factors. 

24.  Use  Theorem  4.5.1 1  to  find  the  first  four  nonzero  terms  in  the  Maclaurin. 

e  cos  x 

(a)  exsinx     (b)  — — -     (c)        —     (d)  (sinx)log(l  +  x) 
1  +  x2-  1  +  x° 

25.  Derive  the  identity 

2  sin  x  cos  x  =  sin2x 
from  the  Maclaurin  series  for  sinx,  cos  x,  and  sin2x. 

26.  (a)   Given  that 

OO 

{\-2xt  +  x2)~1'2  =  J2pn(t)xn,     \x\<l,  (A) 
«=o 
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if — 1  <  t  <  1,  show  that  P0(t)  =  1,  Pi(0  =  f,  and 

2n  +  I  n 

«  +  1  n  +  1 

HINT:  firs?  differentiate  (A)  w/f/i  respect  to  x. 

(b)  Show  from  (a)  that  P„  is  a  polynomial  of  degree  n.  It  is  the  «th  Legendre 
polynomial,  and  (1  —  2xf  +  x2)~ll2  is  the  generating  function  of  the  sequence 
{^}- 

27.  Define  (if  necessary)  the  given  function  so  as  to  be  continuous  at  Xq  =  0,  and  find 
the  first  four  nonzero  terms  of  its  Maclaurin  series. 

xex  .  .  cosjc 

(a)    (b)   -  (c)  sec* 

V  '  sin*  V  '  1  +  Jt  +  x2  V  ' 

sin2x 

(a)  xcscx  (e)   

sinx 

28.  Let  ao  =  a\  =  5  and  an+i  =  a„  —  6a„-i,  n  >  1. 

(a)  Express  F(;c)  =  J^^o  ^n*"  m  closed  form. 

(b)  Write  F  as  the  difference  of  two  geometric  series,  and  find  an  explicit  formula 
for  a„ . 

29.  Starting  from  the  Maclaurin  series 

00    xn  +  l 

log(l-x)  =  -V— -,     |*|  <1, 
„=o  "  +  1 

use  Abel's  theorem  to  evaluate 

oo  . 

T  • 

{n  +  i)(„  +  2) 

30.  In  Example  4.5.17  we  saw  that 


n=0 


Show  that  this  also  holds  for  —  1  <  q  <  0,  but  not  for  q  <  —  1.  Hint:  See  &er- 
cise  4.1.35. 

31.     (a)   Prove:  If  Yllv=o  bn  converges,  then  the  series  g{x)  =  Yln=o  bnx"  converges 
uniformly  on  [0,  1].  Hint:  Ife  >  0,  there  is  an  integer  N  such  that 

\b„+bn+i-\  \-bm\<€    if  n,m>N. 

Use  summation  by  parts  to  show  that  then 

\b„xn  +b„-ix"-1  +  ■■■  +  bmxm\  <  2e    if    0  <  x  <  1,  n,m>N. 

This  is  also  known  as  Abel 's  theorem. 
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(b)    Show  that  (a)  implies  the  restricted  form  of  Theorem  4.5.12  (concerning  g) 
proved  in  the  text. 

32.     Use  Exercise  4.5.31  to  show  that  if  }^^=o       Y^T=o      an^  tneir  Cauchy  product 
Y^T=o  cn  a^  converge,  then 


33.     Prove:  If 


(OO  \      /  OO  \  00 

J2an)[J2bn)  =  J2Cn- 
n=0      I    \n=0      /  n=0 


g(X)  =  Y^bnX\  \X\<\, 


n=0 


and  bn  >  0,  then 


£>„  =   lim  g(x)    (finite  or  infinite). 


34.     Use  the  binomial  series  and  the  relation 

-^-(sin"1*)  =  (1  -x2y1/2 
ax 

to  obtain  the  Maclaurin  series  for  sin-1  x  (sin-1  0=0).  Deduce  from  this  series 
and  Exercise  4.5.33  that 


£ 

n=0 


'2n\  1 


n    22n(2n  +  1)  2 


CHAPTER  5 


Real- Valued  Functions 
of  Several  Variables 


IN  THIS  CHAPTER  we  consider  real-valued  function  of  n  variables,  where  n  >  1 . 

SECTION  5 . 1  deals  with  the  structure  of  K" ,  the  space  of  ordered  n  -tuples  of  real  numbers, 
which  we  call  vectors.  We  define  the  sum  of  two  vectors,  the  product  of  a  vector  and  a 
real  number,  the  length  of  a  vector,  and  the  inner  product  of  two  vectors.  We  study  the 
arithmetic  properties  of  W,  including  Schwarz's  inequality  and  the  triangle  inequality.  We 
define  neighborhoods  and  open  sets  in  W ,  define  convergence  of  a  sequence  of  points  in 
M",  and  extend  the  Heine-Borel  theorem  to  M".  The  section  concludes  with  a  discussion 
of  connected  subsets  of  W . 

SECTION  5.2  deals  with  boundedness,  limits,  continuity,  and  uniform  continuity  of  a  func- 
tion of  n  variables;  that  is,  a  function  defined  on  a  subset  of  W . 

SECTION  5.3  defines  directional  and  partial  derivatives  of  a  real-valued  function  of  n 
variables.  This  is  followed  by  the  definition  of  differentiablity  of  such  functions.  We  define 
the  differential  of  such  a  function  and  give  a  geometric  interpretation  of  differentiablity. 

SECTION  5.4  deals  with  the  chain  rule  and  Taylor's  theorem  for  a  real-valued  function  of 
n  variables. 


5.1  STRUCTURE  OF  Rn 

In  this  chapter  we  study  functions  defined  on  subsets  of  the  real  n -dimensional  space  W, 
which  consists  of  all  ordered  «-tuples  X  =  (x\,  Xi, . . . ,  x„)  of  real  numbers,  called  the 
coordinates  or  components  of  X.  This  space  is  sometimes  called  Euclidean  n-space. 

In  this  section  we  introduce  an  algebraic  structure  for  W .  We  also  consider  its  topologi- 
cal properties;  that  is,  properties  that  can  be  described  in  terms  of  a  special  class  of  subsets, 
the  neighborhoods  in  R" .  In  Section  1 .3  we  studied  the  topological  properties  of  M.1 ,  which 
we  will  continue  to  denote  simply  as  ML  Most  of  the  definitions  and  proofs  in  Section  1 .3 
were  stated  in  terms  of  neighborhoods  in  R..  We  will  see  that  they  carry  over  to  W  if  the 
concept  of  neighborhood  in  M."  is  suitably  defined. 
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Members  of  R  have  dual  interpretations:  geometric,  as  points  on  the  real  line,  and  alge- 
braic, as  real  numbers.  We  assume  that  you  are  familiar  with  the  geometric  interpretation 
of  members  of  R2  and  R3  as  the  rectangular  coordinates  of  points  in  a  plane  and  three- 
dimensional  space,  respectively.  Although  R"  cannot  be  visualized  geometrically  if  n  >  4, 
geometric  ideas  from  R,  R2,  and  R3  often  help  us  to  interpret  the  properties  of  R"  for 
arbitrary  n . 

As  we  said  in  Section  1.3,  the  idea  of  neighborhood  is  always  associated  with  some 
definition  of  "closeness"  of  points.  The  following  definition  imposes  an  algebraic  structure 
on  R",  in  terms  of  which  the  distance  between  two  points  can  be  defined  in  a  natural  way. 
In  addition,  this  algebraic  structure  will  be  useful  later  for  other  purposes. 

Definition  5.1.1  The  vector  sum  of 

X  =  (xi,x2,  ...,xn)    and    Y  =  (ji,y%,  ...,yn) 

is 

X  +  Y  =  (xi  +  yi,x2  +  y2,...,xn  +  yn).  (5.1.1) 
If  a  is  a  real  number,  the  scalar  multiple  o/X  by  a  is 

aX  =  (ax\,ax2, . . .  ,axn).  (5.1.2) 

■ 

Note  that  "+"  has  two  distinct  meanings  in  (5.1.1):  on  the  left,  "+"  stands  for  the  newly 
defined  addition  of  members  of  R"  and,  on  the  right,  for  addition  of  real  numbers.  However, 
this  can  never  lead  to  confusion,  since  the  meaning  of  "+"  can  always  be  deduced  from 
the  symbols  on  either  side  of  it.  A  similar  comment  applies  to  the  use  of  juxtaposition  to 
indicate  scalar  multiplication  on  the  left  of  (5.1.2)  and  multiplication  of  real  numbers  on 
the  right. 

Example  5.1.1  In  R4,  let 

X  =  (1,-2,  6,  5)    and    Y  =  (3,  -5, 4,  j)  . 

Then 

X  + Y=  (4,-7,10,^-) 

and 

6X  =  (6,-12,36,30).  ■ 
We  leave  the  proof  of  the  following  theorem  to  you  (Exercise  5.1.2). 
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Theorem  5.1.2  //X,  Y,  andX  are  in  W  anda  and  b  are  real  numbers,  then 

(a)  X  +  Y  =  Y  +  X  {vector  addition  is  commutative). 

(b)  (X  +  Y)  +  Z  =  X  +  (Y  +  Z)  (vector  addition  is  associative). 

(c)  There  is  a  unique  vector  0,  called  the  zero  vector,  such  that  X  +  0  =  Xfor  all  X  in 
W. 

(d)  For  each  X  in  W  there  is  a  unique  vector  —X  such  that  X  +  (—X)  =  0. 

(e)  a(bX)  =  (ab)X. 

(f)  (a  +  b)X  =  aX  +  bX. 

(g)  a(X  +  Y)  =  aX  +  aY. 

(h)  IX  =  X. 

Clearly,  0  =  (0, 0, . . . ,  0)  and,  if  X  =  (xi,x2, . . . ,  x„),  then 

-X=  (-xu-x2, . . .  ,-xn). 

We  write  X  +  (— Y)  as  X  —  Y.  The  point  0  is  called  the  origin. 

A  nonempty  set  V  =  {X,  Y,  Z, . . . },  together  with  rules  such  as  (5.1.1),  associating  a 
unique  member  of  V  with  every  ordered  pair  of  its  members,  and  (5.1.2),  associating  a 
unique  member  of  V  with  every  real  number  and  member  of  V ,  is  said  to  be  a  vector  space 
if  it  has  the  properties  listed  in  Theorem  5.1.2.  The  members  of  a  vector  space  are  called 
vectors.  When  we  wish  to  emphasize  that  we  are  regarding  a  member  of  W  as  part  of  this 
algebraic  structure,  we  will  speak  of  it  as  a  vector;  otherwise,  we  will  speak  of  it  as  a  point. 

Length,  Distance,  and  Inner  Product 

Definition  5.1.3  The  length  of  the  vector  X  =  (x\,  x%, . . . ,  xn)  is 

\X\  =  (xl+xl  +  ---  +  xl)1'2. 

The  distance  between  points  X  and  Y  is  |X  —  Y|;  in  particular,  |X|  is  the  distance  between 
X  and  the  origin.  If  |X|  =  1,  then  X  is  a  unit  vector.  ■ 

If  n  =  1,  this  definition  of  length  reduces  to  the  familiar  absolute  value,  and  the  distance 
between  two  points  is  the  length  of  the  interval  having  them  as  endpoints;  for  n  =  2  and 
n  =  3,  the  length  and  distance  of  Definition  5.1.3  reduce  to  the  familiar  definitions  for  the 
plane  and  three-dimensional  space. 

Example  5.1.2  The  lengths  of  the  vectors 

X  =  (1,-2,  6,  5)    and  Y=(3,-5,4,i) 

are 

|X|  =  (l2  +  (-2)2  +  62  +  52)1/2  =  V66 
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and   

-v/20T 

|Y|  =  (32  +  (-5)2  +  42  +  (i)2)1/2  =  — — . 

z  2 

The  distance  between  X  and  Y  is 

|X  -  Y|  =  ((1  -  3)2  +  (-2  +  5)2  +  (6  -  4)2  +  (5  -  ±)2)1/2  =  1— . 

z  2 

Definition  5.1.4The  inner productX-Y  of  X  =  (x\,  X2,  ■  ■  ■ ,  xn)  and  Y  =  (ji,  _y2,  •  •  • ,  yn) 
is 

X  Y  =  xxyx  +x2y2-\  h  x„y„. 

Lemma  5.1.5  (Schwarz's  Inequality)  IfX  and  \  are  any  two  vectors  in  W, 
then 

|X-Y|  <  |X||Y|,  (5.1.3) 
with  equality  if  and  only  if  one  of  the  vectors  is  a  scalar  multiple  of  the  other. 

Proof   If  Y  =  0,  then  both  sides  of  (5.1.3)  are  0,  so  (5.1.3)  holds,  with  equality.  In  this 
case,  Y  =  OX.  Now  suppose  that        0  and  t  is  any  real  number.  Then 


0<  -tyi)2 

i  =  l 

=  £xf-2titxiyi+ti£yf  (5.1.4) 

(=1  i  =  l  (  =  1 

=  |X|2-2(X-  Y)t  +  f2|Y|2. 

The  last  expression  is  a  second-degree  polynomial  pint.  From  the  quadratic  formula,  the 
zeros  of  p  are 

_  (X  ■  Y)  ±  V(X- Y)2  -  |X|2|Y|2 

l-        w  ■ 

Hence, 

(X  •  Y)2  <  |X|2|Y|2,  (5.1.5) 

because  if  not,  then  p  would  have  two  distinct  real  zeros  and  therefore  be  negative  between 
them  (Figure  5.1.1),  contradicting  the  inequality  (5.1.4).  Taking  square  roots  in  (5.1.5) 
yields  (5.1.3)  if  Y  ^  0. 

If  X  =  tY,  then  |X  ■  Y|  =  |X||Y|  =  |f||Y|2  (verify),  so  equality  holds  in  (5.1.3). 
Conversely,  if  equality  holds  in  (5.1.3),  then  p  has  the  real  zero  to  =  (X  ■  Y)/|Y||2,  and 


i  =  l 

from  (5.1 .4);  therefore,  X  =  t0 Y.  U 


y 
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>■  t 


Figure  5.1.1 

Theorem  5.1.6  (Triangle  Inequality)  If 'X  and  Y  are  in  W,  then 

|X  +  Y|  <|X|  +  |Y|,  (5.1.6) 
with  equality  if  and  only  if  one  of  the  vectors  is  a  nonnegative  multiple  of  the  other. 

Proof   By  definition, 

n  n  n  n 

ix + yi2  =      +  yi  )2  =  J2  xf +zJ2x>  y<  +  E  yf 

;'  =  1  i  =  l  i  =  l  i  =  l 

=  |X|2  +  2(X-Y)+  |Y|2  (5.1.7) 
<  |X|2  +  2|X|  |Y|  +  |Y|2    (by  Schwarz's  inequality) 
=  (|X|  +  |Y|)2. 

Hence, 

|X  + Y|2  <  (|X|  +  |Y|)2. 

Taking  square  roots  yields  (5.1.6). 

From  the  third  line  of  (5.1.7),  equality  holds  in  (5.1.6)  if  and  only  if  X  •  Y  =  |X||Y|, 
which  is  true  if  and  only  if  one  of  the  vectors  X  and  Y  is  a  nonnegative  scalar  multiple  of 
the  other  (Lemma  5.1 .5).  TJ 

Corollary  5.1.7  IfX,  Y,  and  Z  are  in  W,  then 

|X-Z|  <  |X-Y|  +  |Y-Z|. 

Proof  Write 

X  -  Z  =  (X  -  Y)  +  (Y  -  Z), 
and  apply  Theorem  5. 1 .6  with  X  and  Y  replaced  by  X  —  Y  and  Y  —  Z.  TJ 
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Corollary  5.1.8  If X  and  Y  are  in  W1,  then 

|X-Y|>||X|-|Y||. 

Proof  Since 


Theorem  5.1.6  implies  that 
which  is  equivalent  to 
Interchanging  X  and  Y  yields 


X  =  Y  +  (X-Y), 
|X|<|Y|  +  |X-Y|, 
IXI-IYI  <  IX- Y|. 


|Y|-|X|  <  |Y-X|. 
Since  |X  —  Y|  =  |Y  —  X|,  the  last  two  inequalities  imply  the  stated  conclusion. 


TJ 


Example  5.1.3  The  angle  between  two  nonzero  vectors  X  =  (xi,x%,x-j)  andY  = 
(.Vi  >  yz>  yz)  m  l^3  is  tne  angle  between  the  directed  line  segments  from  the  origin  to  the 
points  X  and  Y  (Figure  5.1.2). 

X 

IXI 


IX-YI 


Figure  5.1.2 

Applying  the  law  of  cosines  to  the  triangle  in  Figure  5.1.2  yields 
|X-Y|2  =  |X|2+  |Y|2-2|X||Y|cos<9. 

However, 

|X  -  Y|2  =  (xt  -  yif  +  (x2  -  yif  +  (x3  -  ys)2 

=  (x2  +  x\  +  xl)  +  (y2  +  yl  +  y\)  -  2{xxyx  +  x2y2  +  x3y3) 
=  IXI2  -f-  |Y|2-2X-Y. 


(5.1.8) 
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Comparing  this  with  (5.1.8)  yields 

X-Y=  |X|  |Y|cos#. 
Since  |  cos  9\  <  1,  this  verifies  Schwarz's  inequality  in  R3. 

Example  5.1.4  Connecting  the  points  0,  X,  Y,  and  X  +  Y  in  R2  or  R3  (Figure  5.1.3) 
produces  a  parallelogram  with  sides  of  length  |X|  and  |  Y|  and  a  diagonal  of  length  |X  +  Y| . 


X 


Y 


Figure  5.1.3 

Thus,  there  is  a  triangle  with  sides  |X|,  |Y|,  and  |X  +  Y|.  From  this,  we  see  geometrically 
that 

|X  +  Y|  <  |X|  +  |Y| 

in  R2  or  R3,  since  the  length  of  one  side  of  a  triangle  cannot  exceed  the  sum  of  the  lengths 
of  the  other  two.  This  verifies  (5.1 .6)  for  R2  and  R3  and  indicates  why  (5.1 .6)  is  called  the 
triangle  inequality.  ■ 

The  next  theorem  lists  properties  of  length,  distance,  and  inner  product  that  follow  di- 
rectly from  Definitions  5.1.3  and  5.1.4.  We  leave  the  proof  to  you  (Exercise  5.1.6). 

Theorem  5.1.9  /fX,  Y,  and  Z  are  members  ofW  and  a  is  a  scalar,  then 

(a)  |aX|  =  |a||X|. 

(b)  |X|  >  0,  with  equality  if  and  only  z/X  =  0. 

(c)  |X  —  Y|  >  0,  with  equality  if  and  only  z/X  =  Y. 

(d)  XY  =  YX. 

(e)  X-(Y  +  Z)  =  X-Y  +  X-Z. 

(f)  (cX)  •  Y  =  X  -  (cY)  =c(X-Y). 
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Line  Segments  in  Kn 

The  equation  of  a  line  through  a  point  Xo  =  {xq,  yo,  zo)  in  IR3  can  be  written  parametri- 
cally  as 

X  =  Xo  +  U\t,      y  =  yo  +  U%t,      Z.  =  Z.0  +  Uj,t,      — OO  <  t  <  OO, 

where  m,  U2,  and  M3  are  not  all  zero.  We  write  this  in  vector  form  as 

X  =  X0  +  fU,    -oo<f<oo,  (5.1.9) 

with  U  =  (u\ ,  112,  M3),  and  we  say  that  the  line  is  through  Xo  in  the  direction  o/U. 
There  are  many  ways  to  represent  a  given  line  parametrically.  For  example, 

X=X0  +  sV,    -oo<s<oo,  (5.1.10) 

represents  the  same  line  as  (5.1.9)  if  and  only  if  V  =  c/U  for  some  nonzero  real  number  a. 
Then  the  line  is  traversed  in  the  same  direction  as  s  and  t  vary  from  —00  to  00  if  a  >  0,  or 
in  opposite  directions  if  a  <  0. 

To  write  the  parametric  equation  of  a  line  through  two  points  Xo  and  Xi  in  K3,  we  take 
U  =  Xi  -  0  in  (5.1.9),  which  yields 

X  =  X0  +  t (Xi  -  X0)  =  fXx  +  (1  -  f)X0,     — 00  <  t  <  00. 

The  line  segment  from  Xo  to  Xi  consists  of  those  points  for  which  0  <  t  <  1. 

Example  5.1.5  The  line  L  defined  by 

x  =  -\+2t,    y  =  3-4t,    z  =  — 1,    -00  <  t  <  00, 

which  can  be  rewritten  as 

X=  (-1,3,-1)  +  t(2,  -4,0),    -oo<f<oo,  (5.1.11) 

is  through  Xo  =  (—1,  3,  —1)  in  the  direction  of  U  =  (2,  —4,  0).  The  same  line  can  be 
represented  by 

X  =  (-1,3,-1)  +  5(1,-2,0),  -oo<i<oo,  (5.1.12) 

or  by 

X=  (-1,3,-1) +  t(-4,  8,0),  -oo<t<oo.  (5.1.13) 

Since 

(1,-2,0)  =  1(2,  -4,0), 

L  is  traversed  in  the  same  direction  as  t  and  s  vary  from  —00  to  00  in  (5.1.11)  and  (5.1.12). 
However,  since 

(-4,8,0)  =  -2(2,-4,0), 
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L  is  traversed  in  opposite  directions  as  t  and  r  vary  from  — oo  to  oo  in  (5.1.1 1)  and  (5.1.13). 

Setting  t  =  1  in  (5.1.11),  we  see  that  Xi  =  (1,  —1,  —1)  is  also  on  L.  The  line  segment 
from  Xo  to  Xi  consists  of  all  points  of  the  form 

X=  ?(l,-l,-l)  +  (l-0(-l,3,-l),    0<t  <  1. 

These  familiar  notions  can  be  generalized  to  R",  as  follows: 

Definition  5.1.10  Suppose  that  X0  and  U  are  in  R"  and  0.  Then  the  line  through 
Xo  in  the  direction  of  U  is  the  set  of  all  points  in  R"  of  the  form 

X  =  X0  +  tXS,    -oo  <  t  <  oo. 

A  set  of  points  of  the  form 

X  =  X0  +  t\J,    h<t<  t2, 

is  called  a  line  segment.  In  particular,  the  line  segment  from  Xo  to  Xi  is  the  set  of  points  of 
the  form 

X  =  X0  +  t(Xi  -  X0)  =  rXi  +  (1  -  t)X0,    0  <  t  <  1. 
Neighborhoods  and  Open  Sets  in  En 

Having  defined  distance  in  K",  we  are  now  able  to  say  what  we  mean  by  a  neighborhood 
of  a  point  in  R" . 

Definition  5.1.11  If  e  >  0,  the  e-neighborhood  of  a  point  Xo  inR"  is  the  set 

N€(X0)\  =  {X||X-X0|  <e}.  M 

An  e-neighborhood  of  a  point  Xo  in  R2  is  the  inside,  but  not  the  circumference,  of  the 
circle  of  radius  e  about  Xo.  In  R3  it  is  the  inside,  but  not  the  surface,  of  the  sphere  of  radius 
e  about  Xo . 

In  Section  1.3  we  stated  several  other  definitions  in  terms  of  6 -neighborhoods:  neigh- 
borhood, interior  point,  interior  of  a  set,  open  set,  closed  set, limit  point,  boundary  point, 
boundary  of  a  set,  closure  of  a  set,  isolated  point,  exterior  point,  and  exterior  of  a  set.  Since 
these  definitions  are  the  same  for  R"  as  for  R,  we  will  not  repeat  them.  We  advise  you  to 
read  them  again  in  Section  1.3,  substituting  R"  for  R  and  Xo  for  xq. 

Example  5.1.6  Let  S  be  the  set  of  points  in  R2  in  the  square  bounded  by  the  lines 
x  =  ±1,  y  =  ±1,  except  for  the  origin  and  the  points  on  the  vertical  lines  x  =  ±1 
(Figure  5.1.4,  page  290);  thus, 

S  =  {(x,  y)  |  (x,  y)  ^  (0, 0),  -1  <  jc  <  1,  -1  <  y  <  1}  . 
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Every  point  of  S  not  on  the  lines  y  =  ±  1  is  an  interior  point,  so 

S°  =  {(x,  y)  |  (x,  y)  ±  (0. 0),  -Kx,y<l}. 

S  is  a  deleted  neighborhood  of  (0.  0)  and  is  neither  open  nor  closed.  The  closure  of  S  is 

S  =  {(x,y)\  -l<x,y<  1}, 

and  every  point  of  S  is  a  limit  point  of  5.  The  origin  and  the  perimeter  of  S  form  35,  the 
boundary  of  S.  The  exterior  of  S  consists  of  all  points  (x,  y)  such  that  |x|  >  1  or  |j|  >  1. 
The  origin  is  an  isolated  point  of  Sc . 


-1-1). 


(1,1) 


Figure  5.1.4 

Example  5.1.7  If  Xo  is  a  point  in  W  and  r  is  a  positive  number,  the  open  n-ball  of 
radius  r  about  Xo  is  the  set  Br(Xo)  =  {X  |  |X  —  Xo|  <  r).  (Thus,  € -neighborhoods  are 
open  n-balls.)  If  Xi  is  in  5V(Xo)  and 

|X-Xi|  <€=  r-|X-Xo|, 

then  X  is  in  5r(Xo).  (The  situation  is  depicted  in  Figure  5.1.5  for  n  =  2.) 

Thus,  5r(Xo)  contains  an  e-neighborhood  of  each  of  its  points,  and  is  therefore  open. 
We  leave  it  to  you  (Exercise  5.1.13)  to  show  that  the  closure  of  Br(Xo)  is  the  closed  «-ball 
of  radius  r  about  Xo,  defined  by 
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Sr(X0)=  {X||X-X0|  <r}. 


Figure  5.1.5 


Open  and  closed  H-balls  are  generalizations  to  M"  of  open  and  closed  intervals. 

The  following  lemma  will  be  useful  later  in  this  section,  when  we  consider  connected 
sets. 

Lemma  5.1.12  IfS.\  andX.2  ore  in  Sr(Xo)  for  some  r  >  0,  then  so  is  every  point  on 
the  line  segment  from  Xi  to  X2. 

Proof   The  line  segment  is  given  by 

X  =  tX2  +  (1  -r)Xi,  0<r<l. 

Suppose  that  r  >  0.  If 

|Xi-X0|<r,  |X2-X0|<r, 

and  0  <  t  <  1,  then 

|X-Xo|  =  |JX2  +  (1-OXi-JX0-(1-OXo| 
=  |?(X2-X0)  +  (l-f)Xi-X0)| 
<?|X2-X0|  +  (l-0|Xi-X0| 

<  tr  +  (1  -i)r  =  r.  ^ 


The  proofs  in  Section  1.3  of  Theorem  1.3.3  (the  union  of  open  sets  is  open,  the  intersec- 
tion of  closed  sets  is  closed)  and  Theorem  1.3.5  and  its  Corollary  1.3.6  (a  set  is  closed  if 
and  only  if  it  contains  all  its  limit  points)  are  also  valid  in  R" .  You  should  reread  them  now. 
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The  Heine-Borel  theorem  (Theorem  1.3.7)  also  holds  in  R",  but  the  proof  in  Section  1.3 
is  valid  only  for  n  =  1.  To  prove  the  Heine-Borel  theorem  for  general  n,  we  need  some 
preliminary  definitions  and  results  that  are  of  interest  in  their  own  right. 

Definition  5.1.13  A  sequence  of  points  {Xr}  in  R"  converges  to  the  limit  X  if 

lim  |X,  -X|  =  0. 

In  this  case  we  write 

lim  Xr  =  X.  ■ 

r— >oo 

The  next  two  theorems  follow  from  this,  the  definition  of  distance  in  R" ,  and  what  we 
already  know  about  convergence  in  R.  We  leave  the  proofs  to  you  (Exercises  5.1.16  and 
5.1.17). 

Theorem  5.1.14  Let 

X  =  (x~i,x~2,  ■  ■  ■  ,x~n)    and    X,  =  (Xi,,  XiT, . . . ,  Xnr),     T  >  1. 
Then  linv^oo  X,  =  X  if  and  only  if 

lim  Xir  =Xi,     1  <  /  <  n; 

r— >oo 

that  is,  a  sequence  {X,}  of  points  in  R"  converges  to  a  limit  X  if  and  only  if  the  sequences 
of  components  o/{Xr}  converge  to  the  respective  components  o/X. 

Theorem  5.1.15  (Cauchy's  Convergence  Criterion)  A  sequence  {X,}  in 
R"  converges  if  and  only  if  for  each  e  >  0  there  is  an  integer  K  such  that 

|X, -Xs\  <e    if  r,s>K. 

The  next  definition  generalizes  the  definition  of  the  diameter  of  a  circle  or  sphere. 

Definition  5.1.16  If  S  is  a  nonempty  subset  of  R",  then 

d(S)  =  sup  {|X  -  Y|  |  X,  Y  e  5} 
is  the  diameter  of  5.  If  d(S)  <  oo,  5  is  bounded;  if  d(S)  =  oo,  5  is  unbounded. 

Theorem  5.1.17  (Principle  of  Nested  Sets)  IfSi,S2,--  -  are  closed  nonempty 
subsets  o/R"  such  that 

Si  D  S2  D  ■■■  D  Sr  D  ■■■  (5.1.14) 

and 

lim  d(Sr)  =  0,  (5.1.15) 

r—>oo 

then  the  intersection 

oo 
r  =  \ 

contains  exactly  one  point. 
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Proof  Let  {Xr }  be  a  sequence  such  that  Xr  €  Sr  (r  >  1).  Because  of  (5.1.14),  Xr  e  St 
if  r  >  A:,  so 

|X,  -Xs|  <  d(Sk)    if   r,j  >  fc. 

From  (5.1.15)  and  Theorem  5.1.15,  Xr  converges  to  a  limit  X.  Since  X  is  a  limit  point  of 
every  Sk  and  every  Sk  is  closed,  X  is  in  every  Sk  (Corollary  1.3.6).  Therefore,  X  e  I ,  so 
7^0.  Moreover,  X  is  the  only  point  in  I ,  since  if  Y  e  7,  then 

\X-Y\<d(Sk),  k>l, 

and  (5.1.15)  implies  that  Y  =  X.  TJ 

We  can  now  prove  the  Heine-Borel  theorem  for  W.  This  theorem  concerns  compact 
sets.  As  in  K,  a  compact  set  in  W  is  a  closed  and  bounded  set. 

Recall  that  a  collection  3t  of  open  sets  is  an  open  covering  of  a  set  S  if 

S  CU{H\H 

Theorem  5.1.18  (Heine  Borel  Theorem)  If  M.  is  an  open  covering  of  a  com- 
pact subset  S,  then  S  can  be  covered  by  finitely  many  sets  from  X . 

Proof  The  proof  is  by  contradiction.  We  first  consider  the  case  where  n  =  2,  so  that 
you  can  visualize  the  method.  Suppose  that  there  is  a  covering  3t  for  S  from  which  it  is 
impossible  to  select  a  finite  subcovering.  Since  S  is  bounded,  S  is  contained  in  a  closed 
square 

T  =  {(x,  y)\a\  <  x  <  a\  +  L,  a-i  <  x  <  a-i  +  L} 
with  sides  of  length  L  (Figure  5.1.6). 


Figure  5.1.6 
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Bisecting  the  sides  of  T  as  shown  by  the  dashed  lines  in  Figure  5.1.6  leads  to  four  closed 
squares,  T(1\  T{2) ,  T(3\  and  T{4\  with  sides  of  length  L/2.  Let 

S(i)  =  S  n  T{i\    1  <  i  <  4. 

Each  S^'',  being  the  intersection  of  closed  sets,  is  closed,  and 

S=[js(i). 

Moreover,  M  covers  each  S"',  but  at  least  one  cannot  be  covered  by  any  finite  sub- 
collection  of  M ,  since  if  all  the  could  be,  then  so  could  S.  Let  Si  be  a  set  with  this 
property,  chosen  from  S^2\  S^3\  and  S^4\  We  are  now  back  to  the  situation  we 
started  from:  a  compact  set  Si  covered  by  M ,  but  not  by  any  finite  subcollection  of  M . 
However,  Si  is  contained  in  a  square  T\  with  sides  of  length  L/2  instead  of  L.  Bisecting 
the  sides  of  T\  and  repeating  the  argument,  we  obtain  a  subset  S2  of  Si  that  has  the  same 
properties  as  S,  except  that  it  is  contained  in  a  square  with  sides  of  length  L/4.  Continuing 
in  this  way  produces  a  sequence  of  nonempty  closed  sets  So  (=  S),  Si,  Si,  ■  ■  ■ ,  such  that 
S/t  D  S/t+i  and  d(Sk)  <  L/2k~ll2  (k  >  0).  From  Theorem  5.1.17,  there  is  a  point  X  in 
PlfcLi  Sfc.  Since  X  e  S,  there  is  an  open  set  HmJ(  that  contains  X,  and  this  /f  must  also 
contain  some  e-neighborhood  of  X.  Since  every  X  in  Sk  satisfies  the  inequality 

|X-X|  <2~k+1/2L, 

it  follows  that  Sk  C  H  for  k  sufficiently  large.  This  contradicts  our  assumption  on  3i, 
which  led  us  to  believe  that  no  Sk  could  be  covered  by  a  finite  number  of  sets  from  3t. 
Consequently,  this  assumption  must  be  false:  3t  must  have  a  finite  subcollection  that  covers 
S.  This  completes  the  proof  for  n  =  2. 

The  idea  of  the  proof  is  the  same  for  n  >  2.  The  counterpart  of  the  square  T  is  the 
hypercube  with  sides  of  length  L: 

T  =  {(xi,  X2, . . . ,  xn)  J  cii  <  Xi  <cij  +  L,i  =  1, 2  n}  . 

Halving  the  intervals  of  variation  of  the  n  coordinates  x\,  X2,  •  •  • ,  xn  divides  T  into  2" 
closed  hypercubes  with  sides  of  length  L/2: 

T(i)  =  {(xi,  x2,  ...,Xn)\bi  <  Xi  <  bi  +  L/2, 1  <  i  <  n)  , 

where  bi  =  cii  or  bi  =  cii  +  L/2.  If  no  finite  subcollection  of  3t  covers  S,  then  at  least 
one  of  these  smaller  hypercubes  must  contain  a  subset  of  S  that  is  not  covered  by  any  finite 
subcollection  of  S.  Now  the  proof  proceeds  as  for  n  =  2.  H 

The  Bolzano-Weierstrass  theorem  is  valid  in  W ;  its  proof  is  the  same  as  in  R. 
Connected  Sets  and  Regions 

Although  it  is  legitimate  to  consider  functions  defined  on  arbitrary  domains,  we  restricted 
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our  study  of  functions  of  one  variable  mainly  to  functions  defined  on  intervals.  There  are 
good  reasons  for  this.  If  we  wish  to  raise  questions  of  continuity  and  differentiability  at 
every  point  of  the  domain  D  of  a  function  /,  then  every  point  of  D  must  be  a  limit  point 
of  D°.  Intervals  have  this  property.  Moreover,  the  definition  of  f  f{x)  dx  is  obviously 
applicable  only  if  /  is  defined  on  [a,b]. 

It  is  not  productive  to  consider  questions  of  continuity  and  differentiability  of  functions 
defined  on  the  union  of  disjoint  intervals,  since  many  important  results  simply  do  not  hold 
for  such  domains.  For  example,  the  intermediate  value  theorem  (Theorem  2.2.10;  see  also 
Exercise  2.2.25)  says  that  if  /  is  continuous  on  an  interval  /  and  f{x\)  <  [i  <  f(x2) 
for  some  X\  and  x-i  in  /,  then  f(x)  =  ji  for  some  X  in  /.  Theorem  2.3.12  says  that  /  is 
constant  on  an  interval  I  if  /'  =  0  on  / .  Neither  of  these  results  holds  if  /  is  the  union  of 
disjoint  intervals  rather  than  a  single  interval;  thus,  if  /  is  defined  on  /  =  (0,  1)  U  (2,  3) 


then  /  is  continuous  on  /,  but  does  not  assume  any  value  between  0  and  1,  and  /'  =  0  on 
/,  but  /  is  not  constant. 

It  is  not  difficult  to  see  why  these  results  fail  to  hold  for  this  function:  the  domain  of  / 
consists  of  two  disconnected  pieces.  It  would  be  more  sensible  to  regard  /  as  two  entirely 
different  functions,  one  defined  on  (0,  1)  and  the  other  on  (2.  3).  The  two  results  mentioned 
are  valid  for  each  of  these  functions. 

As  we  will  see  when  we  study  functions  defined  on  subsets  of  W,  considerations  like 
those  just  cited  as  making  it  natural  to  consider  functions  defined  on  intervals  in  E  lead 
us  to  single  out  a  preferred  class  of  subsets  as  domains  of  functions  of  n  variables.  These 
subsets  are  called  regions.  To  define  this  term,  we  first  need  the  following  definition. 

Definition  5.1.19  A  subset  S  of  E"  is  connected  if  it  is  impossible  to  represent  S  as 
the  union  of  two  disjoint  nonempty  sets  such  that  neither  contains  a  limit  point  of  the  other; 
that  is,  if  S  cannot  be  expressed  as  S  =  A  U  B,  where 


If  S  can  be  expressed  in  this  way,  then  5  is  disconnected. 

Example  5.1.8  The  empty  set  and  singleton  sets  are  connected,  because  they  cannot 
be  represented  as  the  union  of  two  disjoint  nonempty  sets. 

Example  5.1.9  The  space  R"  is  connected,  because  if  W  =  A  U  B  with  1  n  B  =  0 
and  A  n  B  =  0,  then  A  C  A  and  B  C  B;  that  is,  A  and  B  are  both  closed  and  therefore 
are  both  open.  Since  the  only  nonempty  subset  of  W  that  is  both  open  and  closed  is  E" 
itself  (Exercise  5.1.21),  one  of  A  and  B  is  W  and  the  other  is  empty. 


by 


/(*)  = 


1,  0  <  x  <  1. 
0,    2  <  x  <  3, 


A  ^  0,     5^0.     A  n  B  =  0,     and    AnB  =  0. 


(5.1.16) 
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(1,2) 

(3,3) 
(3,2) 

(l,  i) 

Figure  5.1.7 

If  Xi ,  X2, . . . ,  X/t  are  points  in  M"  and  L,  is  the  line  segment  from  X,  to  X;+i,  1  <  i  < 
k  —  1,  we  say  that  L\,  L2,  L^-i  form  a  polygonal  path  from  Xi  to  X/t,  and  that  Xi 
and  Xfe  are  connected  by  the  polygonal  path.  For  example,  Figure  5.1.7  shows  a  polygonal 
path  in  K2  connecting  (0.  0)  to  (3,  3).  A  set  5  is  polygonally  connected  if  every  pair  of 
points  in  S  can  be  connected  by  a  polygonal  path  lying  entirely  in  S . 

Theorem  5.1.20  An  open  set  S  in  M"  is  connected  if  and  only  if  it  is  polygonally 
connected. 

Proof  For  sufficiency,  we  will  show  that  if  S  is  disconnected,  then  S  is  not  polygonally 
connected.  Let  S  =  A  U  B,  where  A  and  B  satisfy  (5.1.16).  Suppose  that  Xi  e  A  and 
X2  e  B,  and  assume  that  there  is  a  polygonal  path  in  S  connecting  Xi  to  X2.  Then  some 
line  segment  L  in  this  path  must  contain  a  point  Yi  in  A  and  a  point  Y2  in  B.  The  line 
segment 

X  =  t\2  +  (1  -f)Yi,  0<f<l, 
is  part  of  L  and  therefore  in  5.  Now  define 

p  =  sup  {r  I  tY2  +  (1  -  r)Yx  ei,0<(<i<l}, 

and  let 

X„  =  pY2  +  (l-p)Yi. 

ThenXp  €  AC\B.  However,  since  Xp  &  A\JB  and  A n B  =  AC\B  =  0,  this  is  impossible. 
Therefore,  the  assumption  that  there  is  a  polygonal  path  in  S  from  Xi  to  X2  must  be  false. 


Section  5.1  Structure  ofW  297 


For  necessity,  suppose  that  S  is  a  connected  open  set  and  Xo  €  S.  Let  A  be  the  set 
consisting  of  Xo  and  the  points  in  S  can  be  connected  to  Xo  by  polygonal  paths  in  S.  Let 
B  be  set  of  points  in  S  that  cannot  be  connected  to  Xo  by  polygonal  paths.  If  Yo  €  S,  then 
S  contains  an  e-neighborhood  jVe(Yo)  of  Yo,  since  S  is  open.  Any  point  Yi  in  N€(Yo  can 
be  connected  to  Yo  by  the  line  segment 

X  =  tYi  +  (1  -0Y0,  0<r<l, 

which  lies  in  Ne(Yo)  (Lemma  5.1.12)  and  therefore  in  S.  This  implies  that  Yo  can  be 
connected  to  Xo  by  a  polygonal  path  in  5  if  and  only  if  every  member  of  jVe(Yo)  can  also. 
Thus,  Ne(Y0)  C  A  if  Y0  e  A,  and  Ne(Y0)  e  B  if  Y0  <=  B.  Therefore,  A  and  B  are  open. 
Since  A  n  B  =  0,  this  implies  that  A  n  ~6  =  1  n  B  =  0  (Exercise  5.1.14).  Since  A  is 
nonempty  (Xo  e  A),  it  now  follows  that  B  =  0,  since  if  B  ^  0,  5  would  be  disconnected 
(Definition  5.1.19).  Therefore,  ^4  =  S,  which  completes  the  proof  of  necessity.  H 

We  did  not  use  the  assumption  that  S  is  open  in  the  proof  of  sufficiency.  In  fact,  we  actu- 
ally proved  that  any  polygonally  connected  set,  open  or  not,  is  connected.  The  converse  is 
false.  A  set  (not  open)  may  be  connected  but  not  polygonally  connected  (Exercise  5.1.29). 

Our  study  of  functions  on  W  will  deal  mostly  with  functions  whose  domains  are  regions, 
defined  next. 

Definition  5.1.21  A  region  S  in  W  is  the  union  of  an  open  connected  set  with  some, 
all,  or  none  of  its  boundary;  thus,  S°  is  connected,  and  every  point  of  S  is  a  limit  point  of 
S°. 

Example  5.1.10  Intervals  are  the  only  regions  in  M  (Exercise  5.1.31).  The  «-ball 
Br(Xo)  (Example  5.1.7)  is  a  region  in  W,  as  is  its  closure  Sr(Xo).  The  set 

S  =  {(x,y)\x2  +  y2  <  1    or    x2  +  y2  >  4} 

(Figure  5.1.8(a),  page  298)  is  not  a  region  in  W2,  since  it  is  not  connected.  The  set  S\ 
obtained  by  adding  the  line  segment 

Lx:    X  =  t(0, 2)  +  (1-0(0,1),    0  <  r  <  1, 

to  S  (Figure  5.1.8  (b))  is  connected  but  is  not  a  region,  since  points  on  the  line  segment  are 
not  limit  points  of  S°.  The  set  S2  obtained  by  adding  to  Si  the  points  in  the  first  quadrant 
bounded  by  the  circles  x2  +  y2  =  1  and  x2  +  y2  =  4  and  the  line  segments  L\  and 

L2:    X  =  /(2,0)  +  (1-0(1,0),    0  <  f  <  1 

(Figure  5.1.8(c)),  is  a  region. 

More  about  Sequences  in  W1 

From  Definition  5.1.13,  a  sequence  {Xr}  of  points  in  W  converges  to  a  limit  X  if  and  only 
if  for  every  e  >  0  there  is  an  integer  K  such  that 

|Xr-X|<€    if    r  >  K. 
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The  R"  definitions  of  divergence,  boundedness,  subsequence,  and  sums,  differences,  and 
constant  multiples  of  sequences  are  analogous  to  those  given  in  Sections  4.1  and  4.2  for 
the  case  where  n  =  1.  Since  R"  is  not  ordered  for  n  >  1,  monotonicity,  limits  inferior  and 
superior  of  sequences  inR",  and  divergence  to  ±oo  are  undefined  for«  >  1.  Products  and 
quotients  of  members  of  R"  are  also  undefined  if  n  >  1 . 


—  *~x 


(c) 

Figure  5.1.8 

Several  theorems  from  Sections  4. 1  and  4.2  remain  valid  for  sequences  in  R" ,  with  proofs 
unchanged,  provided  that  "|  |"  is  interpreted  as  distance  in  R".  (A  trivial  change  is  re- 
quired: the  subscript  n,  used  in  Sections  4.1  and  4.2  to  identify  the  terms  of  the  sequence, 
must  be  replaced,  since  n  here  stands  for  the  dimension  of  the  space.)  These  include  The- 
orems 4.1.2  (uniqueness  of  the  limit),  4.1.4  (boundedness  of  a  convergent  sequence),  parts 
of  4.1.8  (concerning  limits  of  sums,  differences,  and  constant  multiples  of  convergent  se- 
quences), and  4.2.2  (every  subsequence  of  a  convergent  sequence  converges  to  the  limit  of 
the  sequence). 
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5.1  Exercises 


With  R  replaced  by  W ,  the  following  exercises  from  Section  1.3  are  also  suitable  for  this 
section:  1.3.7-1.3.10,  1.3.12-1.3.15,  1.3.19,  1.3.20  {except  (e)),  and  1.3.21. 

1.  FindaX+fcY. 

(a)  X=  (1,2,-3, 1),    Y=  (0,-1, 2,0),  a  =  3,  b  =  6 

(b)  X=  (1,-1,2),    Y  =  (0,  —1, 3),  a  =  —1,  b  =  2 

(c)  X=(|,|,ii),    Y=(-i,l,5,i),a  =  i,6  =  I 

2 .  Prove  Theorem  5 . 1 . 2 . 

3.  Find|X|. 

(a)  (1,2,-3,1)  (b) 

(c)  (1,2,-1,  3, 4)  (d)  (0, 1, 0,  -1, 0,  -1) 

4.  Find|X-Y|. 

(a)  X  =  (3,4,5,-4),    Y=  (2, 0,-1, 2) 

(b)  X  =  (-I,I  I,  -I),    Y  =  (!,-!,  I  -I) 

(c)  X=  (0,0,0),    Y=  (2,-1,2) 

(d)  X  =  (3,-1,4,0,-1),    Y=  (2,0, 1,-4, 1) 

5.  FindX-Y. 

(a)  X  =  (3, 4,  5,  -4),    Y  =  (3, 0,  3,  3) 

(b)  X  =  (g,  j^,  |,  |),    Y  =  (-i,  i,  i,  -i) 

(c)  X  =  (1,2,-3,  1,4),    Y=  (1,2,-1,3,4) 

6 .  Prove  Theorem  5.1.9. 

7.  Find  a  parametric  equation  of  the  line  through  Xo  in  the  direction  of  U. 

(a)  X0  =  (1,2,-3, 1),    U=  (3,4,5,-4) 

(b)  X0  =  (2,0,-1,2,4),    U  =  (-1,0,  1,3,2) 

(c)  X0  =  (-i,i,i,-i),    U  =(!,-!,  I, -I) 

8.  Suppose  that        0  and  V  ^  0.  Complete  the  sentence:  The  equations 

X  =  X0  +  fU.    -oo  <  t  <  oo, 

and 

X  =  Xi  +  sV,    — oo  <  s  <  oo, 
represent  the  same  line  in  R"  if  and  only  if ... 

9.  Find  the  equation  of  the  line  segment  from  Xo  to  Xi . 

(a)  X0  =  (1,-3,4,2),    Xi  =  (2,0,-1,5) 

(b)  X0  =  (3, 1-2,1,4),    Xi  =  (2,0,-1,4,-3) 

(c)  X0  =  (1,2,-1),    X1  =  (0,-1,-1) 
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10.  Find  sup  {e  |  Ne(X0)  C  S}. 

(a)  X0  =  (1,2, -1,3);  5  =  the  open  4-ball  of  radius  7  about  (0,  3,  -2,  2) 

(b)  X0  =  (1,2,-1,3);  S  =  {(xu  x2,  x3,  x4)  |  |jc,-|  <  5, 1  <  /  <  4} 

(c)  X0  =  (3,  §);  S  =  the  closed  triangle  with  vertices  (2,  0),  (2,  2),  and  (4,  4) 

11.  Find  (i)  3S;  (ii)  S;  (iii)  5°;  (iv)  exterior  of  S. 

(a)  S  =  {(xi,x2,  x3,  X4)  I  \xi  I  <  3,  i  =  1,2,  3} 

(b)  5  =  \(x,y,  1)  I  x2  +  y2  <  1} 

12.  Describe  the  following  sets  as  open,  closed,  or  neither. 

(a)  5  =  {(xi,x2,x3,x4)  I  |*i I  >  0, x2  <  1,*3  ^  -2} 

(b)  5  =  {(xi,x2,x3,x4)  \xi  =  l,x3  7^  -4} 

(c)  5  =  {(xi,x2,  x3,  X4)  J  x\  =  1,  —  3  <  x2  <  1,  X4  =  —5} 

13.  Show  that  the  closure  of  the  open  n-ball 

Br{X0)  =  {X||X-X0|  <r} 

is  the  closed  «-ball 

5r(X0)=  {X||X-X0|  <r}. 

14.  Prove:  If  A  and  B  are  open  and  A  n  5  =  0,  then  4nfi=InB  =  0. 

15.  Show  that  if  limr^oo  Xr  exists,  then  it  is  unique. 

16.  Prove  Theorem  5.1.14. 

17.  Prove  Theorem  5.1.15. 

18.  Find  limr-xx,  Xr. 

(a)    Xr  =  (r  sin  — ,  cos  — ,  e_r) 

<»*-(4'4^K)') 

19.  FindJ(S). 

(a)    5=  {(Jc,y,jc)||jc|<2,|y[<l,|z-2|<2} 

(c)  S  =  the  triangle  in  R2  with  vertices  (2,  0),  (2,  2),  and  (4,  4) 

(d)  S  =  {(xi,x2,  ...,x„)  \  \xt  \  <  L,  i  =  1,2, . . .,«} 

(e)  S  =  {(jc,j,z)|x/0,|j|  <  l,z  >2} 

20.  Prove  that  c?(5)  =  d(S)  for  any  set  S  in  R" . 

21.  Prove:  If  a  nonempty  subset  S  of  R"  is  both  open  and  closed,  then  S  =  R" . 
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Use  the  Bolzano-Weierstrass  theorem  to  show  that  if  Si,  S2,  ■  Sm,  ...  is  an 
infinite  sequence  of  nonempty  compact  sets  and  Si  D  S2  D  •  ■  •  D  Sm  D  •  •  • ,  then 
Hot=i  Sm  is  nonempty.  Show  that  the  conclusion  does  not  follow  if  the  sets  are 
assumed  to  be  closed  rather  than  compact. 

Suppose  that  a  sequence  U\,  TJ%,  ...  of  open  sets  covers  a  compact  set  S.  Without 
using  the  Heine-Borel  theorem,  show  that  S  C  Um=i  Um  f°r  some  N .  Hint: 
Apply  Exercise  5.1.22  to  the  sets  S„  =  S  fl  (Um=i  ^mf  • 

(This  is  a  seemingly  restricted  version  of  the  Heine-Borel  theorem,  valid  for  the 
case  where  the  covering  collection  3t  is  denumerable.  However,  it  can  be  shown 
that  there  is  no  loss  of  generality  in  assuming  this.) 

The  distance  from  a  point  Xo  to  a  nonempty  set  S  is  defined  by 
dist(X0,S)  =  inf{|X-X0|  |  X  €  S}  . 

(a)  Prove:  If  S  is  closed  and  Xo  el",  there  is  a  point  X  in  S  such  that 

|X-X0|  =dist(X0,S). 

Hint:  Apply  Exercise  5.1.22  to  the  sets 

Cm  =  {X|X  e  S  and\X-X0  \  <  dist(X0.  S)  +  l/m)  ,    m  >  1. 

(b)  Show  that  if  S  is  closed  and  X0  ^  S,  then  dist(X0,  S)  >  0. 

(c)  Show  that  the  conclusions  of  (a)  and  (b)  may  fail  to  hold  if  S  is  not  closed. 
25.     The  distance  between  two  nonempty  sets  S  and  T  is  defined  by 

dist(S,  T)  =  inf  {|X  -  Y|  |  X  e  S,  Y  e  T)  . 

Prove:  If  S  is  closed  and  T  is  compact,  there  are  points  X  in  S  and  Y  in  T 
such  that 

|X-Y|  =  dist(S,  T). 
Hint:  Use  Exercises  5.1.22  and  5.1.24. 

Under  the  assumptions  of  (a),  show  that  dist(S,  7")  >  0  if  S  fl  7  =  0. 
Show  that  the  conclusions  of  (a)  and  (b)  may  fail  to  hold  if  S  or  T  is  not 
closed  or  T  is  unbounded. 

Prove:  If  a  compact  set  S  is  contained  in  an  open  set  U,  there  is  a  positive 
number  r  such  that  the  set 

Sr  =  {X|  dist(X,  S)  <  r] 

is  contained  in  U .  (You  will  need  Exercise  5.1.24  here.) 
(b)    Show  that  Sr  is  compact. 


22. 


23. 


24. 


(a) 

(b) 
(c) 

26.  (a) 
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27.  Let  D\  and  D2  be  compact  subsets  of  R".  Show  that 

D  =  {(X,Y)  |X  e  Di,Ye  D2} 

is  a  compact  subset  of  R2" . 

28.  Prove:  If  S  is  open  and  S  =  A  U  B  where  InB=4nB  =  0,  then  A  and  B  are 
open. 

29.  Give  an  example  of  a  connected  set  in  W  that  is  not  polygonally  connected. 

30.  Prove  that  a  region  is  connected. 

31.  Show  that  the  intervals  are  the  only  regions  in  R. 

32.  Prove:  A  bounded  sequence  in  W  has  a  convergent  subsequence.  Hint:  Use  Theo- 
rems 5.1.14,  4.2.2,  and 4.2.5(a). 

33.  Define  "limr^oo  Xr  =  00"  if  {Xr}  is  a  sequence  inR",  n  >  2. 

5.2  CONTINUOUS  REAL- VALUED  FUNCTIONS  OF  n  VARI- 
ABLES 

We  now  study  real- valued  functions  of  n  variables.  We  denote  the  domain  of  a  function  / 
by  D f  and  the  value  of  /  at  a  point  X  =  (x\ ,  Xi, . . . ,  x„)  by  f  (X)  or  / (xi ,  Xi, . . . ,  x„). 
We  continue  the  convention  adopted  in  Section  2.1  for  functions  of  one  variable:  If  a  func- 
tion is  defined  by  a  formula  such  as 

f(X)  =  (l-x21-xl  xlf2  (5.2.1) 

or 

g{X)  =  (I  -  x\  -  x22  x2n)~l  (5.2.2) 

without  specification  of  its  domain,  it  is  to  be  understood  that  its  domain  is  the  largest 
subset  of  W  for  which  the  formula  defines  a  unique  real  number.  Thus,  in  the  absence  of 
any  other  stipulation,  the  domain  of  /  in  (5.2.1)  is  the  closed  «-ball  {X  |  |X|  <  l},  while 
the  domain  of  g  in  (5.2.2)  is  the  set  {X  |  |X|  ^  l}. 

The  main  objective  of  this  section  is  to  study  limits  and  continuity  of  functions  of  n 
variables.  The  proofs  of  many  of  the  theorems  here  are  similar  to  the  proofs  of  their  coun- 
terparts in  Sections  2.1  and  .  We  leave  most  of  them  to  you. 

Definition  5.2.1  We  say  that  /(X)  approaches  the  limit  L  as  X  approaches  Xo  and 
write 

lim  f(X)  =  L 

X^Xo 

if  Xo  is  a  limit  point  of  D  f  and,  for  every  e  >  0,  there  is  a  S  >  0  such  that 

|/(X)-L|<6 


for  all  X  in  D  /  such  that 

0  <  |X-X0|  <  8. 
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Example  5.2.1  If 

g(x,y)  =  \-x2-2y2, 


then 


lim  g(x,y)=  l-x20-2y2  (5.2.3) 
(x,y)-+(xo,yo) 


(5.2.4) 


for  every  (xo,  yo).  To  see  this,  we  write 

\g(x,  y)-(l-  x2  -  2y2)\  =  \(l-x2-  2y2)  -  (1  -  x2  -  2y2)\ 

<  \x2-x2\  +  2\y2-y2\ 
=  \(x  +  x0)(x  -  x0)\  +  2\(y  +  y0)(y  -  y0)\ 

<  |X-X0|(|x  +  x0|  +  2|>>  +  j0)|), 

since 

|jc-x0|<|X-Xo|    and     \y  -  y0\  <  |X  -  X0|. 
If  |X  — X0|  <  l.then  |x|  <  |*0|  +  1  and  |y|  <  |y0|  +  1.  This  and  (5.2.4)  imply  that 

\g(x,y)-(l-x2-2y2)\  <  K\X-X0\    if  |X-X0|<1, 

where 

K=(2\x0\  +  l)  +  2(2\y0\  +  l), 

Therefore,  if  e  >  0  and 

|X-X0|  <  8  =  min{l,e/^}, 

then 

\g(x,y)-(l-x2-2y2)\  <  e. 
This  proves  (5.2.3).  ■ 

Definition  5.2.1  does  not  require  that  /  be  defined  at  Xo,  or  even  on  a  deleted  neighbor- 
hood of  Xo. 

Example  5.2.2  The  function 

h(x,  y)  = 


sin  -y/ 1  —  x2  —  2y2 


jl-x2-2y2 

is  defined  only  on  the  interior  of  the  region  bounded  by  the  ellipse 

x2  +  2y2  =  1 

(Figure  5.2.1  (a),  page  304).  It  is  not  defined  at  any  point  of  the  ellipse  itself  or  on  any 
deleted  neighborhood  of  such  a  point.  Nevertheless, 

lim       h{x,y)  =  \  (5.2.5) 
(x,y)-*(xo,yo) 
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if 

To  see  this,  let 
Then 

Recall  that 


x20+2y20  =  L 

u(x,  y)  =  \J\  —x2  —  2y2. 

sinw(x,  y) 


(5.2.6) 


h(x,y)  = 


u(x,y) 


(5.2.7) 


sinr 
lim  =  1: 


r 

therefore,  if  e  >  0,  there  is  a  Si  >  0  such  that 


siriw 


1 


<  e    if   0  <  \u\  <  8\. 


(5.2.8) 


From  (5.2.3), 


lim       (1  -  x2  -  2y2)  =  0 


if  (5.2.6)  holds,  so  there  is  a  8  >  0  such  that 

0  <  u2(x,  y)  =  (1  -  x2  -  2y2)  <  82 

if  X  =  (x,  y)  is  in  the  interior  of  the  ellipse  and  |X  —  Xo|  <  8;  that  is,  if  X  is  in  the  shaded 
region  of  Figure  5.2.1  (b). 

Therefore,   

0  <  u  =  ^/l-x2-2y2  <  81  (5.2.9) 

if  X  is  in  the  interior  of  the  ellipse  and  |X  —  Xo  |  <  8;  that  is,  if  X  is  in  the  shaded  region  of 
Figure  5.2.1  (b).  This,  (5.2.7),  and  (5.2.8)  imply  that 

\h(x,y)-l\<€ 

for  such  X,  which  implies  (5.2.5). 

y 


i)  i  ix-xj  =  5 


/ 

/ 

"  \ 

\ 

\ 

— t  



\ 

/ 

\ 

— 

~~      x2+2y2  =  l 

(a) 


(b) 


Figure  5.2.1 
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The  following  theorem  is  analogous  to  Theorem  2.1.3.  We  leave  its  proof  to  you  (Exer- 
cise 5.2.2). 

Theorem  5.2.2  //limx-^xo  /(X)  exists,  then  it  is  unique. 

When  investigating  whether  a  function  has  a  limit  at  a  point  Xo,  no  restriction  can  be 
made  on  the  way  in  which  X  approaches  Xo,  except  that  X  must  be  in  D  r ,  The  next 
example  shows  that  incorrect  restrictions  can  lead  to  incorrect  conclusions. 

Example  5.2.3  The  function 

xy 


f(x,y)  = 


x2  +  y2 


is  defined  everywhere  in  M2  except  at  (0,  0).  Does  lim^^-^co)  f(x,  y)  exist?  If  we  try 
to  answer  this  question  by  letting  (x,  y)  approach  (0,  0)  along  the  line  y  =  x,  we  see  the 
functional  values 

x2  1 

/(*•*)  =  2^  =  2 

and  conclude  that  the  limit  is  1/2.  However,  if  we  let  (x,  y)  approach  (0,  0)  along  the  line 
y  =  —x,  we  see  the  functional  values 

x2  1 
/C».-*)  =  "2i»=-2 

and  conclude  that  the  limit  equals  —1/2.  From  Theorem  5.2.2,  these  two  conclusions 
cannot  both  be  correct.  In  fact,  they  are  both  incorrect.  What  we  have  shown  is  that 

\imf(x,x)=—    and     \imf(x,—x)  = — . 
x^o  2  x->o  2 

Since  limx_>o  f(x,x)  and  \imx-*o  / (jc,  —  x)  must  both  equal  lim^-^-^o)  f(x,y)  if  the 
latter  exists  (Exercise  5.2.3  (a)),  we  conclude  that  the  latter  does  not  exist.  ■ 

The  sum,  difference,  and  product  of  functions  of  n  variables  are  defined  in  the  same 
way  as  they  are  for  functions  of  one  variable  (Definition  2.1.1),  and  the  proof  of  the  next 
theorem  is  the  same  as  the  proof  of  Theorem  2.1.4. 

Theorem  5.2.3  Suppose  that  f  and  g  are  defined  on  a  set  D,  Xo  is  a  limit  point  of 
D,  and 


Then 


lim  /(X)  =  Li,      lim  g(X)  =  L2. 

X— >Xo  X— *-Xq 


lim  (f  +  g)(X)  =  Li  +  L2,  (5.2.10) 

X->-Xo 

lim  (f-g)(X)  =  L1-L2,  (5.2.11) 

x->-x0 

lim  (fg)(X)  =  LXL2,  (5.2.12) 
x^x0 


and,  if  L2  ^  0, 


Urn  (1) 

x->xt)  \gj 


(X)  =  -i.  (5.2.13) 


L 


2 


306    Chapter  5  Real-Valued  Functions  of  Several  Variables 
Infinite  Limits  and  Limits  as  |X|->oo 

Definition  5.2.4  We  say  that  / (X)  approaches  oo  as  X  approaches  Xo  and  write 

lim  /(X)  =  oo 

if  Xo  is  a  limit  point  of  D  f  and,  for  every  real  number  M,  there  is  a  8  >  0  such  that 
/(X)  >  M    whenever    0  <  |X  -  X0|  <  8    and    X  e  Df. 

We  say  that 

lim  /(X)  =  -oo 

X^Xq 


if 


lim  (-/)(X)  =  oo. 

X^Xq 


Example  5.2.4  If 

f(x)  =  (i-x21-xj  xir1'2, 

then 

xlim  /(X)  =  oo 

if  |Xo|  =  1,  because 

/(X)  =  ' 


|X-X0|' 
so 

f(X)>M    if    0  <  |X-X0|  <  8  =  — . 

M 

Example  5.2.5  If 

f(x,y)  = 


x  +  2y  +  1 ' 

then  lim(Xi3,)^(!  _x)  f(x,  y)  does  not  exist  (why  not?),  but 

lim  |/(x,jO|=oo. 
(jc,y)-»-(l,-l) 

To  see  this,  we  observe  that 

|x  +  2y+l|  =  |(x-l)  +  2(y  +  l)| 

<  \/5|X  —  Xo |    (by  Schwarz's  inequality), 

where  Xo  =  (1,-1),  so 

\f(x,y)\  =   l-  >  ' 


\x  +  2y+l\  V5IX-X0I 
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Therefore, 

\f(x,y)\  >  M    if    0<  |X-Xo|  <  ' 


MV5' 

Example  5.2.6  The  function 


f(x,y,z) 


x2  +  y2  +  z2 


x2  +  y2+z2 


assumes  arbitrarily  large  values  in  every  neighborhood  of  (0,  0,  0).  For  example,  if  X^  = 
(xk,yk,z,k),  where 

1 

xk=yk=zk= 

then 

f(Xk)  =  (k  +  \)  *■ 

However,  this  does  not  imply  that  limx^o  / (X)  =  oo,  since,  for  example,  every  neighbor- 
hood of  (0,  0,  0)  also  contains  points 

Xk_f     1  1  1  \ 

\\/3kjt  *j3kn  V3Icjt J 

for  which  /(Xl)  =  0. 

Definition  5.2.5  If  Df  is  unbounded,  we  say  that 

lim   f(X)  =  L  (finite) 

if  for  every  e  >  0,  there  is  a  number  R  such  that 

|/(X)  -L\<€    whenever     |X|  >  R    and    X  e  Df. 

Example  5.2.7  If 

f(x,y,z)  =  cos(x2  +  2ly2+z2y 

then 

lim  /(X)  =  l.  (5.2.14) 

|X|^oo 

To  see  this,  we  recall  that  the  continuity  of  cos  u  at  u  =  0  implies  that  for  each  e  >  0  there 
is  a  8  >  0  such  that 

I  cosw  —  II  <  e    if    \u\  <  8. 
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Since 

1  1 
x2  +  2y2  +  z2  ~  |XP' 
it  follows  that  if  |X|  >  l/VE,  then 

1  <*. 


x2  +  2y2  +  z2 
Therefore, 

|/(X)-1|<6. 

This  proves  (5.2.14). 

Example  5.2.8  Consider  the  function  defined  only  on  the  domain 
D  =  {(x,  y)  |  0  <  y  <  ax} ,    0  <  a  <  1 

(Figure  5.2.2),  by 

f{x,y)=  ' 


x-y 
We  will  show  that 

lim  f(x,y)  =  0.  (5.2.15) 

|X|-MX> 

It  is  important  to  keep  in  mind  that  we  need  only  consider  (x,  y)  in  D,  since  /  is  not 
defined  elsewhere. 

InD, 

x-y  >x(\-a)  (5.2.16) 

and 

|X|2  =  x2  +  y2  <x2(l  +  a2), 

so 

|X| 

x  > 


This  and  (5.2.16)  imply  that 

x-y>-^L=\X\,  Xefl, 
VI  +  a2 

so 

VI  +a2  1 
1  —  a  |X| 

Therefore, 

\f(x.y)\<€ 

ifXeDand 

IX,  >  4±?I. 

I  —  a  e 

This  implies  (5.2.15). 
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y 


Figure  5.2.2 

We  leave  it  to  you  to  define  limix|->-oo  /(X)  =  oo  and  lim|x|-Mx>  /(X)  =  — oo  (Exer- 
cise 5.2.6). 

We  will  continue  the  convention  adopted  in  Section  2.1:  "linix^x0  / (X)  exists"  means 
that  linix^xo  / (X)  =  L,  where  L  is  finite;  to  leave  open  the  possibility  that  L  =  ±oo,  we 
will  say  that  "linix^x0  / (X)  exists  in  the  extended  reals."  A  similar  convention  applies  to 
limits  as  |X|  — >  oo. 

Theorem  5.2.3  remains  valid  if  "limx^x0"  is  replaced  by  "lim|x|->oo>"  provided  that 
D  is  unbounded.  Moreover,  (5.2.10),  (5.2.11),  and  (5.2.12)  are  valid  in  either  version  of 
Theorem  5.2.3  if  either  or  both  of  L\  and  is  infinite,  provided  that  their  right  sides  are 
not  indeterminate,  and  (5.2.13)  remains  valid  if  Li  ^  0  and  L\j L-i  is  not  indeterminate. 

Continuity 

We  now  define  continuity  for  functions  of  n  variables.  The  definition  is  quite  similar  to  the 
definition  for  functions  of  one  variable. 

Definition  5.2.6  If  Xo  is  in  D f  and  is  a  limit  point  of  D /,  then  we  say  that  /  is 
continuous  at  Xo  if 

lim  /(X)  =  /(Xo).  ■ 

X->Xo 

The  next  theorem  follows  from  this  and  Definition  5.2.1. 
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Theorem  5.2.7  Suppose  that  Xo  is  in  D  f  and  is  a  limit  point  of  D  y.  Then  f  is  con- 
tinuous at  Xo  if  and  only  if  for  each  e  >  0  there  is  a  8  >  0  such  that 

|/(X)  - /(X0)|  <  6 

whenever 

|X-X0|<<5    and  XeDf. 

In  applying  this  theorem  when  Xo  £  D^,  we  will  usually  omit  "and  X  e  D /,"  it  being 
understood  that  S^(Xo)  C  D /. 

We  will  say  that  /  is  continuous  on  S  if  /  is  continuous  at  every  point  of  S. 

Example  5.2.9  From  Example  5.2.1,  we  now  see  that  the  function 

f(x,y)=  l-x2-2y2 


Example  5.2.10  If  we  extend  the  definition  of  h  in  Example  5.2.2  so  that 

n  yfl  -x2-2y7 


■2 


i  .  xz  +  2yz<l, 

h(x,y)=\  yfi-x2-2y2 

\,  x2  +  2y2  =  l, 

then  it  follows  from  Example  5.2.2  that  h  is  continuous  on  the  ellipse 

x2  +  2y2  =  1. 

We  will  see  in  Example  5.2.13  that  h  is  also  continuous  on  the  interior  of  the  ellipse. 
Example  5.2.11  It  is  impossible  to  define  the  function 

rt   \  *y 

f(x,y)  = 


x2  +  y2 

at  the  origin  to  make  it  continuous  there,  since  we  saw  in  Example  5.2.3  that 


Jim  f(x,y) 

(x,j)^(0,0) 


does  not  exist. 


Theorem  5.2.3  implies  the  next  theorem,  which  is  analogous  to  Theorem  2.2.5  and,  like 
the  latter,  permits  us  to  investigate  continuity  of  a  given  function  by  regarding  the  function 
as  the  result  of  addition,  subtraction,  multiplication,  and  division  of  simpler  functions. 
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Theorem  5.2.8  // /  and  g  are  continuous  on  a  set  S  in  W ,  then  so  are  f  +  g,  f  —  g, 
and  fg.  Also,  f/g  is  continuous  at  each  Xo  in  S  such  that  g(Xo)  ^  0. 

Vector- Valued  Functions  and  Composite  Functions 

Suppose  that  g\,  g2,  gn  are  real-valued  functions  defined  on  a  subset  T  of  W" ,  and 
define  the  vector-valued  function  G  on  T  by 

G(U)  =  (gl(U),  g2(U), gn(V)) ,    U  6  T. 

Then  gi,  gi, . . . ,  gn  are  the  component  functions  of  G  =  (gi,  g2,  ■  ■  ■ ,  gn)-  We  say  that 

lim  G(U)  =  L  =  (LUL2, . . . ,  Ln) 
u^u0 

if 

lim  £,(U)  =  Li,     1  <  (  <  7i, 

and  that  G  is  continuous  at  Uo  if  g\,  gi,       gn  are  each  continuous  at  Uo. 

The  next  theorem  follows  from  Theorem  5.1.14  and  Definitions  5.2.1  and  5.2.6.  We  omit 
the  proof. 

Theorem  5.2.9  For  a  vector-valued  function  G, 

lim  G(U)  =  L 

u->u0 

if  and  only  if  for  each  e  >  0  there  is  a  8  >  0  such  that 

|G(U)  -  L|  <  €    whenever    0  <  |U  -  U0|  <  S    and    U  €  DG. 
Similarly,  G  is  continuous  at  Uo  if  and  only  if  for  each  e  >  0  there  is  a  8  >  0  such  that 

|G(U)-G(U0)|  <€    whenever    |U-U0|<<5    and    U  e  Dq. 

The  following  theorem  on  the  continuity  of  a  composite  function  is  analogous  to  Theo- 
rem 2.2.7. 

Theorem  5.2.10  Let  f  be  a  real-valued  function  defined  on  a  subset  ofW,  and  let 
the  vector-valued  function  G  =  (gi ,  gi, . . . ,  gn)  be  defined  on  a  domain  Dq  in  W .  Let 
the  set 

r  =  {u|UeDG    and    G(U)  eD/j 
{Figure  5.2.3),  be  nonempty,  and  define  the  real-valued  composite  function 

h  =  f  o  G 

on  T  by 

h(V)  =  /(G(U)),    U  e  T. 

Now  suppose  that  Uo  is  in  T  and  is  a  limit  point  of  T,  G  is  continuous  at  Uo,  and  f  is 
continuous  at  Xo  =  G(Uo).  Then  h  is  continuous  at  Uq. 
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R(G)  =  range  of  G 


Figure  5.2.3 

Proof  Suppose  that  e  >  0.  Since  /  is  continuous  at  Xrj  =  G(Uo),  there  is  an  e\  >  0 
such  that 

|/(X)-/(G(U„))|  <€  (5.2.17) 

if 

|X-G(U0)|<6i    and    XeD/.  (5.2.18) 
Since  G  is  continuous  at  Un,  there  is  a  S  >  0  such  that 

|G(U)  -  G(U0)|  <  ei    if    |U-U0|<<5    and    U  e  DG. 

By  taking  X  =  G(U)  in  (5.2.17)  and  (5.2.18),  we  see  that 

|A(U)  -  A(U0)|  =  |/(G(U)  -  /(G(U0))|  <  € 

if 


|U-U0|<<5    and    U  e  T. 


H 


Example  5.2.12  If 

and 


m  =  ^ 


g(x,y)  =  l-x2-2y2 


then  Df  =  [0,  oo],  £)g  =  M2,  and 

r  =  {(*,  j)|x2  +  2j2  <  i}. 

From  Theorem  5.2.7  and  Example  5.2.1,  g  is  continuous  on  K2.  (We  can  obtain  the  same 
conclusion  by  observing  that  the  functions  p\ (x,  y)  =  x  and  p-i(x,  y)  =  y  are  continuous 
on  R2  and  applying  Theorem  5.2.8.)  Since  /  is  continuous  on  D /,  the  function 


Kx,  y)  =  f  (g(x,  y))  =  ^l-x^-2y2 


is  continuous  on  T. 
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Example  5.2.13  If   

g(x,  y)  =  jl-x2-2y2 


and 

'  sin  s 


then  D  f  =  (— oo,  oo)  and 


s 

1. 


,     S  ^0, 


Dg  =  T  =  {(x,y)\x2  +  2y2  <l}. 

In  Example  5.2.12  we  saw  that  g  (we  called  it  h  there)  is  continuous  on  T .  Since  /  is 
continuous  on  D  f,  the  composite  function  h  =  f  o  g  defined  by 

J\ -x2-2y2 

v  x2  +  2y2<l. 


Hx,y)=\  y/i_x2_2y2 

x2  +  2y2  =  1, 

is  continuous  on  T.  This  implies  the  result  of  Example  5.2.2. 


Bounded  Functions 

The  definitions  of  bounded  above,  bounded  below,  and  bounded  on  a  set  S  are  the  same  for 
functions  of  n  variables  as  for  functions  of  one  variable,  as  are  the  definitions  of  supremum 
and  infimum  of  a  function  on  a  set  S  (Section  2.2).  The  proofs  of  the  next  two  theorems  are 
similar  to  those  of  Theorems  2.2.8  and  2.2.9  (Exercises  5.2.12  and  5.2.13). 

Theorem  5.2.11  If  f  is  continuous  on  a  compact  set  S  in  R",  then  f  is  bounded 
on  S. 

Theorem  5.2.12  Let  f  be  continuous  on  a  compact  set  S  inW  and 

«=inf/(X),  /3=sup/(X). 
XeS  XeS 

Then 

f(Xi)  =  a    and    f(X2)  =  0 

for  some  Xi  andX2  in  S. 

The  next  theorem  is  analogous  to  Theorem  2.2.10. 

Theorem  5.2.13  (Intermediate  Value  Theorem)  Let  f  be  continuous  on 
a  region  S  in  W .  Suppose  that  A  and  B  are  in  S  and 

/(A)  <  u  <  /(B). 

Then  /(C)  =  u  for  some  C  in  S. 
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Proof   If  there  is  no  such  C,  then  S  =  R  U  T,  where 

R  =  {X|X  €  S  and  /(X)  <  u) 

and 

T  =  {X|X  €  S  and  /(X)  >  u} . 

IfXo  £  R,  the  continuity  of  /  implies  that  there  is  a  8  >  0  such  that  /(X)  <  w  if  |X— Xo|  < 
S  and  X  e  5.  This  means  that  X0  £  T.  Therefore,  R  n  7  =  0.  Similarly,  ~R  n  T  =  0. 
Therefore,  S  is  disconnected  (Definition  5.1.19),  which  contradicts  the  assumption  that  S 
is  a  region  (Exercise  5.1.30).  Hence,  we  conclude  that  / (C)  =  u  for  some  C  in  S.  TJ 

Uniform  Continuity 

The  definition  of  uniform  continuity  for  functions  of  n  variables  is  the  same  as  for  functions 
of  one  variable;  /  is  uniformly  continuous  on  a  subset  S  of  its  domain  in  K"  if  for  every 
€  >  0  there  is  a  8  >  0  such  that 

|/(X)-/(X')|<e 

whenever  |X  —  X'|  <  S  and  X.  X'  e  S.  We  emphasize  again  that  8  must  depend  only  on  e 
and  5,  and  not  on  the  particular  points  X  and  X'. 

The  proof  of  the  next  theorem  is  analogous  to  that  of  Theorem  2.2.12.  We  leave  it  to  you 
(Exercise  5.2.14). 

Theorem  5.2.14  If  f  is  continuous  on  a  compact  set  S  in  M",  then  f  is  uniformly 
continuous  on  S. 


5.2  Exercises 


With  K  replaced  by  W ,  the  following  exercises  from  Sections  2 . 1  and  2 .2  have  analogs  for 
this  section:  2.1.5,  2.1.8-2.1.11,  2.1.26,  2.1.28,  2.1.29,  2.1.33,  2.2.8,  2.2.9,  2.2.10,  2.2.15, 
2.2.16,  2.2.20,  2.2.29,  2.2.30. 

1.    Find  limx^xo  / (X)  and  justify  your  answer  with  an  €-8  argument,  as  required  by 
Definition  5.2.1.  Hint:  See  Examples  5.2.1  and5.2.2. 

(a)  f(X)  =  3x  +  4y  +  z-2,    X0  =  (1,2,1) 

(b)  /(X)  =  flzZl,    Xo  =  (l,l) 

x-y 

,  .  sin(x  +  Ay  +  2z) 

c     f(X)=  -,    X0  =  (-2,1,-1) 

x  +  4y  +  2z 
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(d)    /(X)  =  (x2  +  y2)  log(x2  +  y2)1'2,    X0  =  (0.  0) 

.  .  sin(x  —  y) 

e     f(X  =_^^,    X0  =  (2,2) 

(f)  /(x)  =  ±e-i/w    Xo  =  o 

2.  Prove  Theorem  5.2.2. 

3.  If  \imx^XQ  y(x)  =  y0  and  limx^xo  /  (x,y(x))  =  L,  we  say  that  f(x,y)  ap- 
proaches L  as  (x,  y)  approaches  (xo,  yo)  along  the  curve  y  =  y(x). 

(a)  Prove:  If  lim(JCi>,)^(JCOjj,0)  f(x,  y)  =  L,  then  f(x,  y)  approaches  L  as  (x,  y) 
approaches  (xq,  yo)  along  any  curve  y  =  y(x)  through  (xq,  yo). 

(b)  We  saw  in  Example  5.2.3  that  if 

f(x>y)  =  2^  2- 

then  lim(XJ,)^(o;o)  f(x,y)  does  not  exist.  Show,  however,  that  f(x,y)  ap- 
proaches a  value  La  as  (x,  y)  approaches  (0.  0)  along  any  curve  y  =  y(x) 
that  passes  through  (0,  0)  with  slope  a.  Find  La. 

(c)  Show  that  the  function 

3  4 

x^y* 

g(x,  y)  = 


(x2  +  y6)3 

approaches  0  as  (x,  y)  approaches  (0,  0)  along  a  curve  as  described  in  (b), 
but  that  lim(x^)^(o,o)  f(x,y)  does  not  exist. 

4.     Determine  whether  limx->x0  /(X)  =  ±oo. 

|  sin(x  +2y  +  4z)\ 
(x+2y  +  4z)2 


(a)    /(x)=         ,  ...  :  „,x2 X0  =  (2,-1,0) 


(b)  /(X)  =  — =,    X0  =  (0,0) 

,  .     „  sinl/x 

(c)  /(X)=_J=,  X0=(0,0) 

(d)  /(x)  =  (i^'  Xo  =  (2a) 

(e)  /(X)=7(^t2^y.    X„  =  (2,-1,0) 

(x  +  2y  +  Az,Y 

Find  lim.|x|— >oo  / (X),  if  it  exists. 

(a)  /(X,  .  '°^W>        (b)  /(X)  _         +  £ 
xz  +  yz  +  zz  y/x2  +  y2 


(c)  /(X)  =  e~^2  (d)  /(X)  =  e~*2-y2 
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(  sin(x2  -y2) 
(e)/(X)=       x2-y2    •  x*±y> 
[l,  x  =  ±y 

6.  Define  (a)  lirri|x|->oo  /(X)  =  oo  and  (b)  lizn|x|-»-oo  /(X)  =  — °o. 

7.  Let 

/(X)-  • 

For  what  nonnegative  values  of  ct\,  a-i,  . . . ,  a„,  b  does  limx->o  / (X)  exist  in  the 
extended  reals? 

8.  Let 

1  +  xby4 

Show  that  lim^i^oo  g(x,  ax)  =  oo  for  any  real  number  a.  Does 

lim  g(X)  =  oo? 

|X|-»oo 

9.  For  each  /  in  Exercise  5.2.1,  find  the  largest  set  S  on  which  /  is  continuous  or  can 
be  defined  so  as  to  be  continuous. 

10.  Repeat  Exercise  5.2.9  for  the  functions  in  Exercise  5.2.5. 

11.  Give  an  example  of  a  function  /  on  R2  such  that  /  is  not  continuous  at  (0,0), 
but  / (0,  y)  is  a  continuous  function  of  y  on  (— oo,  oo)  and  f(x,  0)  is  a  continuous 
function  of  x  on  (— oo,  oo). 

12.  Prove  Theorem  5.2.1 1.  Hint:  See  the  proof  of  Theorem  2.2.8. 

13.  Prove  Theorem  5.2.12.  Hint:  See  the  proof  of  Theorem  2.2.9. 

14.  Prove  Theorem  5.2.14.  Hint:  See  the  proof  of  Theorem  2.2.12. 

1 5 .  Suppose  that  X  e  D  f  Cl"  and  X  is  a  limit  point  of  D  f .  Show  that  /  is  continuous 
at  X  if  and  only  if  lim^oo  / (X^)  =  / (X)  whenever  {X^}  is  a  sequence  of  points 
in  D  f  such  that  lim^oo  X^  =  X.  Hint:  See  the  proof  of  Theorem  4.2.6. 

5.3  PARTIAL  DERIVATIVES  AND  THE  DIFFERENTIAL 


To  say  that  a  function  of  one  variable  has  a  derivative  at  xo  is  the  same  as  to  say  that  it 
is  differentiable  at  xq.  The  situation  is  not  so  simple  for  a  function  /  of  more  than  one 
variable.  First,  there  is  no  specific  number  that  can  be  called  the  derivative  of  /  at  a  point 
Xo  in  W .  In  fact,  there  are  infinitely  many  numbers,  called  the  directional  derivatives  of 
f  at  Xo  (defined  below),  that  are  analogous  to  the  derivative  of  a  function  of  one  variable. 
Second,  we  will  see  that  the  existence  of  directional  derivatives  at  Xo  does  not  imply  that  / 
is  differentiable  at  Xo,  if  differentiability  at  Xo  is  to  imply  (as  it  does  for  functions  of  one 
variable)  that  / (X)  —  / (Xo  )  can  be  approximated  well  near  Xo  by  a  simple  linear  function, 
or  even  that  /  is  continuous  at  Xq. 
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We  will  now  define  directional  derivatives  and  partial  derivatives  of  functions  of  several 
variables.  However,  we  will  still  have  occasion  to  refer  to  derivatives  of  functions  of  one 
variable.  We  will  call  them  ordinary  derivatives  when  we  wish  to  distinguish  between  them 
and  the  partial  derivatives  that  we  are  about  to  define. 

Definition  5.3.1  Let  $  be  a  unit  vector  and  X  a  point  in  W .  The  directional  derivative 
of  f  atX  in  the  direction  of  $  is  defined  by 

3/(X)     ,.    /(X  +  r»)-/(X) 

—        =  hm  

3<I>       t^o  t 

if  the  limit  exists.  That  is,  df  (X)/3$  is  the  ordinary  derivative  of  the  function 

h(t)  =  f(X  +  t*) 

at  t  =  0,  if  h'(0)  exists. 

Example  5.3.1  Let  $  =  (<pi,4>2, 03)  and 

f(x,  y,  z)  =  3xyz.  +  2x2  +  z2. 

Then 

h(t)  =  f(x  +  t <pi ,  y  +  t (j>2,  Z  +  tfo), 

=  3(x  +  t<pi)(y  +  t<p2)(z  +  ffo)  +  2(x  +  f(/)i)2  +  (z.+  f03)2 

and 

h'{t)  =  30i  (y  +  t(p2)(z  +  tfa)  +  3<p2(x  +  t<Pi)(z  +  tfo) 

+  303(jt  +  t<pi)(y  +  tfc)  +  40i  (x  +  t<pi)  +  203(z  +  tfo). 

Therefore, 

=  h'(0)  =  (3yz,  +  4x)0i  +  3xz</>2  +  (3xy  +  2z,)03.  (5.3.1) 


The  directional  derivatives  that  we  are  most  interested  in  are  those  in  the  directions  of 
the  unit  vectors 

Ei  =  (1,0,...,  0),    E2  =  (0,1,0,...,  0),...,    E„  =  (0,...,0, 1). 

(All  components  of  E,  are  zero  except  for  the  i  th,  which  is  1.)  Since  X  and  X  +  ?E;  differ 
only  in  the  i'th  coordinate,  3/(X)/3E;  is  called  the  partial  derivative  of  f  with  respect  to 
Xi  atX.  It  is  also  denoted  by  3/(X)/3x,  or  fx.  (X);  thus, 

9/(x)      f   ~~      ,.     f(xi  +t,x2,...,x„)-  f(xi,x2,  ...,x„) 

  =  fxi  (X)  =  hm  , 

dx\  t^o  t 


318    Chapter  5  Real-  Valued  Functions  of  Several  Variables 


3/(X)  f(Xi,...,Xi-i,X, ■  +t,Xi  +  i,...,Xn)  -  f(X\,X%,  ...,Xn) 

-5        =  fx,  (X)  =  hm  

dXi  t-^o  t 

if  2  <  i  <  »,  and 

—  =  A„(X)  =  hm  , 

dx„  t~*o  t 

if  the  limits  exist. 

If  we  write  X  =  (x,  y),  then  we  denote  the  partial  derivatives  accordingly;  thus, 

Bf(x,y)       ,f      ,  f(x  +  h,y)-f(x,y) 

— 5  =  fx(x,  y)  =  hm  

ox  h->-o  n 

and 

9/(*.3Q      f(     ,      v     f(x,y  +  h)-f(x,y) 

— 5  =  fy(x>  y)  =  ,hm  ;  • 

dy  h~*o  h 

It  can  be  seen  from  these  definitions  that  to  compute  fXj  (X)  we  simply  differentiate  / 
with  respect  to  x,  according  to  the  rules  for  ordinary  differentiation,  while  treating  the  other 
variables  as  constants. 


Example  5.3.2  Let 

f(x,y,z)  =  3xyz  +2x2  +  z.2  (5.3.2) 
as  in  Example  5.3.1.  Taking  $  =  Ej  (that  is,  setting  0i  =  1  and  4>2  —  <p3  =  0)  in  (5.3.1), 
we  find  that 

df(X)  3/(X) 
ox  dEi 

which  is  the  result  obtained  by  regarding  y  and  z,  as  constants  in  (5.3.2)  and  taking  the 
ordinary  derivative  with  respect  to  x.  Similarly, 

3/(X)  3/(X) 


dy  3E2 

and 

3/(X)  _  3/(X) 
3z  3E3 


=  3xz 


=  3xy  +  2z. 


The  next  theorem  follows  from  the  rule  just  given  for  calculating  partial  derivatives. 

Theorem  5.3.2  //  A,  (X)  andgXi(X)  exist,  then 
3(/  +  g)(X) 


dXi 
3(/g)(X) 

3x; 


=  fx.(X)  +  gXi(X), 
=  fXi(X)g(X)  +  f(X)gXj(X), 
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Xj  \dxi  J      dxjdxt         '  1 

33/ 


and,  ifg(X)  ^  0, 

3(//g)(X)  _  g(X)fXi(X)-f(X)gXi(X) 
d*i  [g(X)]2 

If  fx.  (X)  exists  at  every  point  of  a  set  D,  then  it  defines  a  function  /Xj.  on  D.  If  this 
function  has  a  partial  derivative  with  respect  to  xj  on  a  subset  of  D,  we  denote  the  partial 
derivative  by 

_      =  J2 

dx 

Similarly, 

3   /  d2f  \         33/  _ 
3xfc  v dxj dxi )      dxkdxjdxi  xix.ixk- 

The  function  obtained  by  differentiating  /  successively  with  respect  to  ,  x;-2, . . . ,  xir  is 
denoted  by 

3xiV3xiV_,.--3xil  --/^-"r-i^. 
it  is  an  rth-order  partial  derivative  of  f . 

Example  5.3.3  The  function 

f(x,y)  =  3x2y3+xy 

has  partial  derivatives  everywhere.  Its  first-order  partial  derivatives  are 

fx(x,  y)  =  6xy3  +  y,    fy(x,  y)  =  9x 2y2  +  x. 

Its  second-order  partial  derivatives  are 

fxx(x,  y)  =  6j3,  fyy(x,  y)  =  18x2y, 

fxy(x,y)=  18xy2  +  l,    fyx(x,y)  =  \%xy2  +  1. 


There  are  eight  third-order  partial  derivatives.  Some  examples  are 

fxxy(x,y)  =  18j2,    fXyx(x,y)  =  18j2,    fyxx(x,y)  =  18j2. 

Example  5.3.4  Compute  fxx(0, 0),  fyy(0, 0),  fxy(0, 0),  and  fyx(0, 0)  if 


{(x2y  +  xy2)  sin(x  -  y) 
 ^2  .  «>.0). 
0,  (x,y)  =  (0,0). 


Solution  If  (x,  y)  ^  (0,  0),  the  ordinary  rules  for  differentiation,  applied  separately  to 
x  and  y,  yield 

(2xy  +  y2)  sin(x  —  y)  +  (x2y  +  xy2)  cos(x  —  y) 

2x(x2y  +  xy2)  sin(x  -  y) 

— ,    (x,  y)  ^  (0, 0), 


(x2  +  j2)2 
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(x2  +  2xy)  sin(x  -  y)  -  (x2y  +  xy2)cos(x  -  y) 

fy(*>y)=  

2y(x2y  +  xy2)sm(x-y)  ,  ,  ,n  m  (5'3'4> 

(jc2  +  y2)2 

These  formulas  do  not  apply  if  (x,  j>)  =  (0,  0),  so  we  find  fx(0,  0)  and  fy(0,  0)  from  their 
definitions  as  difference  quotients: 

/JC(o,o)  =  iim/^0)-/(0'0)  =  iim^  =  o, 

x->0  X  x^>0  x 

/v(0.  0)  =  hm   =  hm  =  0. 

y  y~*0  y 

Setting  j  =  0  in  (5.3.3)  and  (5.3.4)  yields 

fx(x,  0)  =  0,    fy(x,  0)  =  sinx,    x  ^  0, 

so 

,   ,nm      ,.     A(*,0)- A(0,0)      ..  0-0 
Ax(0,  0)  =  hm   =  hm  =  0, 

x^0  X  x-*-0  X 

,   ,nm     r                                 r     sinx-0  , 
Jyx(0,0)  =  hm   =  hm   =  1. 

x^-0  X  x^0  X 

Setting  x  =  0  in  (5.3.3)  and  (5.3.4)  yields 

fx(0,y)=-smy,    fy(0,y)=0,    y  ±  0. 

so 

,   m  m      ,.     A(0,y)-A(0,0)      ,.  -siny-0 

/jcy(0,  0)  =  hm   =  hm   =  —1, 

y-±0  y  y^-0  y 

/w(0.  0)  =  hm   =  hm  =  0.  ■ 

yy  y^O  y  y^O  y 


This  example  shows  that  fXy(Ko)  and  fyx(Xo)  may  differ.  However,  the  next  theorem 
shows  that  they  are  equal  if  /  satisfies  a  fairly  mild  condition. 


Theorem  5.3.3  Suppose  that  f,  fx,  fy,  and  fxy  exist  on  a  neighborhood  N  of(xo,  yo), 
and  fxy  is  continuous  at  {xq,  yo).  Then  fyx(xo,  yo)  exists,  and 

fyx(X0,  yo)  =  fxy(X0,  }>o).  (5.3.5) 


Proof    Suppose  that  e  >  0.  Choose  S  >  0  so  that  the  open  square 
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Sg  =  {(x,  y)  |  \x  -xo\  <  S,  \y  -  jo|  <  $} 


is  in  N  and 


\fxy(x,y)- fxy(Xo,yo)\  <  €         if    (x,J)  €  SS.  (5.3.6) 

This  is  possible  because  of  the  continuity  of  fxy  at  (xq,  yo)-  The  function 

A(h,  k)  =  f(x0  +  h,y0  +  k)-  f(x0  +  h,  y0)  -  f(x0,  y0  +  k)  +  f(x0,  yo)  (5.3.7) 
is  defined  if  —  S  <  h,  k  <  8;  moreover, 

A(h,k)  =  4>(x0  +  h)-(p(xo),  (5.3.8) 

where 


<p(x)  =  f(x,  y0  +  k)-  f(x,  yo). 


Since 


cf>'(x)  =  fx(x,  yo  +  k)-  fx(x,  y0),    \x-xQ\  <  S, 
(5.3.8)  and  the  mean  value  theorem  imply  that 

A(h,  k)  =  [fx(x,  yo  +  k)-  fx(x,  y0)]  h, 


(5.3.9) 


where  x  is  between  Xq  and  xo  +  h.  The  mean  value  theorem,  applied  to  fx(x,  y)  (where  x 
is  regarded  as  constant),  also  implies  that 

fx(x,  yo  +  k)-  fx(x,  y0)  =  fxy(x,y)k, 

where  'y  is  between  yo  and  yo  +  k.  From  this  and  (5.3.9), 

A(h,k)  =  fxy{x,y)hk. 

Now  (5.3.6)  implies  that 

A(h,k) 


hk 


~  fxy(XQ,yo) 


\fxy(x,y)- fxy(x0,yo)\  <  e    if  0  <  \h\,  \k\  <  S. 

(5.3.10) 


Since  (5.3.7)  implies  that 

A(h,k) 
lim  — — — 

k^o  hk 


lim 


f(x0  +  h,y0  +  k)-  f(x0  +  h,  y0) 
hk 

f(x0,yo  +  k)-  f(x0,y0) 


-  lim 

k^o  hk 

fy(x0  +  h,  y0)  -  fy(x0,  y0) 


it  follows  from  (5.3.10)  that 

fy(xo  +  h,y0)-  fy(x0,yo) 
h 


fxy(Xo,yo) 


<€    if  0<\h\<8. 
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Taking  the  limit  as  h  — »■  0  yields 

\fyx(XQ,  yo)  ~  fxy(XQ,  Jo) I  <  €. 

Since  e  is  an  arbitrary  positive  number,  this  proves  (5.3.5).  TJ 

Theorem  5.3.3  implies  the  following  theorem.  We  leave  the  proof  to  you  (Exercises  5.3.10 
and  5.3.11). 

Theorem  5.3.4  Suppose  that  f  and  all  its  partial  derivatives  of  order  <  r  are  contin- 
uous on  an  open  subset  S  of  MP.  Then 

fxh  *,2„. (X)  =  fXh  xh,...,xJr  (X),     X  e  S.  (5.3.1 1) 

if  each  of  the  variables  X\,  Xj,  ... ,  xn  appears  the  same  number  of  times  in 

{xi  j ,     , . . . ,  Xir }    and    {x j ^ ,  xj2 , . . . ,  x jr } . 
If  this  number  is  r^.  we  denote  the  common  value  of  the  two  sides  of  (53.1 1)  by 

drf(X) 


dx['  dxj2  ■  ■  ■  dxrn" 

it  being  understood  that 


(5.3.12) 


0<rk<r,     l<k<n,  (5.3.13) 
n  +  r2  +  ---  +  r„  =  r,  (5.3.14) 

and,  ifrjc  =  0,  we  omit  the  symbol  dx^from  the  "denominator"  of  (53.12). 

For  example,  if  /  satisfies  the  hypotheses  of  Theorem  5.3.4  with  k  =  4  at  a  point  Xo  in 
W  (n  >  2),  then 

fxxyyO^-o)  =  fxyxy(X-o)  =  fxyyxOi-o)  =  fyyxxQ^-o)  =  fyxyxO^-o)  =  fyxxy(X-o), 

and  their  common  value  is  denoted  by 

94/(Xq) 
dx2dy2 

It  can  be  shown  (Exercise  5.3.12)  that  if  /  is  afunctionof  (x\,  X2,  •  •  • ,  xn)  and  (r\,  r%,..., 
is  a  fixed  ordered  «-tuple  that  satisfies  (5.3.13)  and  (5.3.14),  then  the  number  of  partial 
derivatives  fXilxi2-xir  that  involve  differentiation  r,  times  with  respect  to  Xi ,  1  <  i  <  n, 
equals  the  multinomial  coefficient 


ri!r2!  •■■/•„!' 
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Differentiable  Functions  of  Several  Variables 

A  function  of  several  variables  may  have  first-order  partial  derivatives  at  a  point  Xo  but  fail 
to  be  continuous  at  Xo .  For  example,  if 

1      A  V         (v.  vItMO.O). 


then 


and 


f(x,y)  =  lx2  +  y2     v""  '  (5.3.15) 
(0,  (x,y)  =  (0,0), 


,,nm     r    f(h,0)  -/(Q,Q)  0-0 

fx(0,  0)  =  hm   =  hm  — - —  =  0 

h^o  n  h^o  n 


,,ftn,     r    /(0,fc)-/(0,0)     ..  o-o 
/v(0.  0)  =  hm   =  hm  — - —  =  0, 

y  k^o  k  k~*o  k 


but  /  is  not  continous  at  (0,  0).  (See  Examples  5.2.3  and  5.2.1 1.)  Therefore,  if  differentia- 
bility of  a  function  of  several  variables  is  to  be  a  stronger  property  than  continuity,  as  it  is 
for  functions  of  one  variable,  the  definition  of  differentiability  must  require  more  than  the 
existence  of  first  partial  derivatives.  Exercise  2.3.1  characterizes  differentiability  of  a  func- 
tion /  of  one  variable  in  a  way  that  suggests  the  proper  generalization:  /  is  differentiable 
at  xq  if  and  only  if 

,.     fix)  -  f(x0)  -  m(x  -  x0) 
hm    =  0 

x^x0  X  —  Xq 

for  some  constant  m,  in  which  case  m  =  f'(xo)- 

The  generalization  to  functions  of  n  variables  is  as  follows. 

Definition  5.3.5  A  function  /  is  differentiable  at 

Xo  =  (*10,  *20,  •  •  • ,  xno)) 
if  Xq  e  D°  and  there  are  constants  m\,  m-i,  ....  mn  such  that 


/(X)-/(Xo) -*,<)) 


x1^  nHS  =  a  (53'16) 


Example  5.3.5  Let 

f(x,y)  =  x2  +  2xy. 
We  will  show  that  /  is  differentiable  at  any  point  (xq,  yo),  as  follows: 
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f(x,  y)  -  f(x0,  yo)  =  x2  +  2xy  -  x1  -  2x0y0 
=  x2  -  x2  +  2{xy  -  x0y0) 

=  (x  -  x0)(x  +  x0)  +  2{xy  -  x0y)  +  2(x0y  -  x0y0) 
=  (x  +  x0  +  2y)(x  -  x0)  +  2x0(y  -  yo) 
=  2(x0  +  yo)(x  -  x0)  +  2x0(y  -  yo) 

+  (x-  x0)(x  -  x0  +  2y  -  2y0) 
=  mx{x  -xo)  +  m2(y  -  yo)  +  (x  -  x0)(x  -  x0  +  2y  -2y0), 

where 

mi  =  2(x0  +  yo)  =  fx(xo,  yo)    and    m2  =  2x0  =  fy(x0,  yo)-  (5.3.17) 
Therefore, 

\f(x,  y)  -  f{xp,  yo)  -mx{x-  x0)  -  m2(y  -  yo)\  _  \x  -  xQ\\(x  -  x0)  +  2{y  -  y0)\ 
|X-X0|  "  |X-Xo| 

<  V5|X-X0|, 

by  Schwarz's  inequality.  This  implies  that 

,.     f(x,y)- f(xo,yo)-mi(x-xo)-m2(y -yo)  _ 

lim   ■  ■   =  0, 

x^x0  |X-X0| 

so  /  is  differentiable  at  (xq,  yo)-  ■ 

From  (5.3.17),  m\  =  fx(xo,yo)  and  m2  =  fy{xo,yo)  in  Example  5.3.5.  The  next 
theorem  shows  that  this  is  not  a  coincidence. 

Theorem  5.3.6  //  /  is  differentiable  at  Xo  =  (x\o,  x2o,  ■  ■  ■ ,  x„o),  then  fXl(Xo). 
_/jc2(Xo),  . . . ,  fXn(Xo)  exist  and  the  constants  m\,  m2,  ... ,  mn  in  (5.3.16)  are  given  by 

mi  =  fXi(Xf)),     l<i<n-  (5.3.18) 

that  is, 

n 

f(X)  -  /(Xo)  -      M  (Xo)(xi  -  xi0) 

i  =  l 

x^-Xo  X  —  Xq 


lim   — — — ■   =  0. 


Proof  Let  i  be  a  given  integer  in  {1, 2, . . . ,«}.  Let  X  =  Xo  +  fE,-,  so  that  x,  =  xto+t, 
Xj  =  Xjo  if  j  ^  z,  and  |X  —  Xo|  =  \t\.  Then  (5.3.16)  and  the  differentiability  of  /  at  Xo 
imply  that 

f(Xo  +  tEi)-f(X0)-mit 
lim   =  0. 

t~+o  t 
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Hence, 

,.    /(X0  +  rE,-)-/(X0) 

hm   =  m, . 

t^o  t 

This  proves  (5.3.18),  since  the  limit  on  the  left  is  fXj  (Xo),  by  definition.  H 
A  linear  function  is  a  function  of  the  form 

L(X)  =  miXi  +  m2x2  H  1-  mn*n,  (5.3.19) 

where  mi,  «12,  ■  ■  ■ .  mn  are  constants.  From  Definition  5.3.5,  /  is  differentiable  at  Xo  if 
and  only  if  there  is  a  linear  function  L  such  that  /(X)  —  /(Xo)  can  be  approximated  so 
well  near  Xo  by 

L(X)-L(X0)  =  L(X-X0) 

that 

/(X)  -  /(Xo)  =  L(X  -  Xo)  +  E(X)(\X  -  Xo|),  (5.3.20) 

where 

lim  E(X)  =  0.  (5.3.21) 

x-s-Xo 

Theorem  5.3.7  Iff  is  differentiable  atXo,  then  f  is  continuous  at Xo. 
Proof   From  (5.3.19)  and  Schwarz's  inequality, 

\L(X-X0)\  <M|X-X0|, 

where 

M  =  (m\  +  m\-\  h  m^)1/2. 

This  and  (5.3.20)  imply  that 

|/(X)-/(X0)|  <(M +  |£(X)|)|X-X0|, 

which,  with  (5.3.21),  implies  that  /  is  continuous  at  Xo.  H 

Theorem  5.3.7  implies  that  the  function  /  defined  by  (5.3.15)  is  not  differentiable  at 
(0,0),  since  it  is  not  continuous  at  (0,0).  However,  /^(O,  0)  and  /y(0,  0)  exist,  so  the 
converse  of  Theorem  5.3.7  is  false;  that  is,  a  function  may  have  partial  derivatives  at  a 
point  without  being  differentiable  at  the  point. 

The  Differential 

Theorem  5.3.7  implies  that  if  /  is  differentiable  at  Xo,  then  there  is  exactly  one  linear 
function  L  that  satisfies  (5.3.20)  and  (5.3.21): 

L(X)  =  fXl(Xo)xi  +  fX2(X0)x2  +  ■■■  +  fXn(X0)x„. 
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This  function  is  called  the  differential  of  f  atXo.  We  will  denote  it  by  dx0  f  and  its 
value  by  (<ix0/)(X);  thus, 

(dXQf)(X)  =  fXl  (Xo)jc,  +  fX2(X0)x2  +  ■■■  +  fXn(X0)xn.  (5.3.22) 

In  terms  of  the  differential,  (5.3.16)  can  be  rewritten  as 

]im  /(X)-/(X0)-(rfxo/)(X-Xo)  = 
x™0  |X-Xo| 

For  convenience  in  writing  dx0f,  and  to  conform  with  standard  notation,  we  introduce 
the  function  dx, ,  defined  by 

dxj(X)  =  X[  \ 

that  is,  dxi  is  the  function  whose  value  at  a  point  in  R"  is  the  2  th  coordinate  of  the  point.  It 
is  the  differential  of  the  function  gi  (X)  =  Xi .  From  (5.3.22), 

dx0f  =  fx,  (X0)  rfjci  +  fX2(X0  dx2  +  ---  +  fXn  (X0)  rfx„.  (5.3.23) 
If  we  write  X  =  (x,  J, ...,),  then  we  write 

^Xo/  =  A(X0)       +  /y  (X0)  rfj  +  •  •  •  , 

where  dx,  dy,  ...  are  the  functions  defined  by 

dx(X)  =  x,    dy(X)  =  y,... 

When  it  is  not  necessary  to  emphasize  the  specific  point  Xo,  (5.3.23)  can  be  written  more 
simply  as 

df  =  fxi  dx\  +  fX2  dx2  H  h  fXn  dx„. 

When  dealing  with  a  specific  function  at  an  arbitrary  point  of  its  domain,  we  may  use  the 
hybrid  notation 

df  =  fXl  (X)  dXi  +  fX2  (X)  dx2  +  ---  +  fXn  (X)  dxn . 

Example  5.3.6  We  saw  in  Example  5.3.5  that  the  function 

f(x,y)  =  x2+2xy 
is  differentiable  at  every  X  in  R" ,  with  differential 

df  =  (2x  +  2y)  dx  +  2x  dy. 
To  find  d\0f  with  Xo  =  (1,  2),  we  set  xq  =  1  and  yo  =  2;  thus, 

dx0  f  =  6  dx  +  2  dy 

and 

(dXof  )(X  -  Xo)  =  6(x  -  1)  +  2(y  -  2). 
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Since  f{\,  2)  =  5,  the  differentiability  of  /  at  (1,  2)  implies  that 

f(x,y)-5-6(x-l)-2(y-2) 
hm   ■   =  0. 

(*,y)-Kl,2)  ^(x  -  1)2  +  (y-  2)2 

Example  5.3.7  The  differential  of  a  function  f  =  f  (x)  of  one  variable  is  given  by 

dxQf  =  f'(x0)dx, 
where  dx  is  the  identity  function;  that  is, 

dx(t)  =  t. 

For  example,  if 

f(x)  =  3x2  +  5x3, 

then 

df  =  (6x  +  I5x2)dx. 

If  xo  =  —  1 ,  then 

dxof  =  9dx,    (dXQf)(x  -  x0)  =  9(x  +  1), 
and,  since  /(—  1)  =  —2, 

,.      f(x)  +  2-9(x+l) 

hm    =0.  ■ 

X-+-1  x  +  1 

Unfortunately,  the  notation  for  the  differential  is  so  complicated  that  it  obscures  the 
simplicity  of  the  concept.  The  peculiar  symbols  df,  dx,  dy,  etc.,  were  introduced  in 
the  early  stages  of  the  development  of  calculus  to  represent  very  small  ("infinitesimal") 
increments  in  the  variables.  However,  in  modern  usage  they  are  not  quantities  at  all,  but 
linear  functions.  This  meaning  of  the  symbol  dx  differs  from  its  meaning  in  f£  f  (x)  dx, 
where  it  serves  merely  to  identify  the  variable  of  integration;  indeed,  some  authors  omit  it 
in  the  latter  context  and  write  simply  f%  f. 

Theorem  5.3.7  implies  the  following  lemma,  which  is  analogous  to  Lemma  2.3.2.  We 
leave  the  proof  to  you  (Exercise  5.3.13). 

Lemma  5.3.8  If  f  is  dijferentiable  atXo,  then 

/(X)  -  /(Xo)  =  (dXof  )(X  -  Xo)  +  E(X)\X  -  X0|, 
where  E  is  defined  in  a  neighborhood  of  X$  and 

lim  E(X)  =  E(X0)  =  0. 

X-*-Xo 

Theorems  5.3.2  and  5.3.7  and  the  definition  of  the  differential  imply  the  following 
theorem. 
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Theorem  5.3.9  Iff  and  g  are  dijferentiable  atXo,  then  so  are  f  +  g  and  fg.  The 
same  is  true  of  f  / g  ifg(Xo)  ^  0.  The  differentials  are  given  by 

dx0(f  +  g)  =  dx0f  +  dx0g, 

dx0(fg)  =  f(X0)dXog  +  g(X0)dXof, 

and 

g(X0)dXof  -  f(X0)dXog 


'x0 


[?(Xo)] 


2 


The  next  theorem  provides  a  widely  applicable  sufficient  condition  for  differentiability. 

Theorem  5.3.10  If  fxi ,  fX2, ... ,  fXn  exist  on  a  neighborhood  of  Xq  and  are  contin- 
uous at  Xq,  then  f  is  differentiable  atX^. 

Proof  Let  Xo  =  {x\q,x2q,  . . . ,  x„q)  and  suppose  that  e  >  0.  Our  assumptions  imply 
that  there  is  a  8  >  0  such  that  fX{ ,  fX2, . . . ,  fXn  are  defined  in  the  «-ball 

S8(X0)  =  {X|  |X-X0|  <S} 

and 

\fXj  (X)  -  fXj  (X0)|  <  €    if    |X  -  X0|  <  S,     l<j<n.  (5.3.24) 
Let  X  =  (xi,x, . . . ,  x„)  be  in  5^(Xo).  Define 

Xj  =  (xi, . .  .,Xj,Xj+i,o,  ■  ■  .,xn0),     1  <  j  <  n  -  1, 

and  Xn  =  X.  Thus,  for  1  <  j  <  n,  Xj  differs  from  X7_i  in  the  j th  component  only,  and 
the  line  segment  from  X7_i  to  Xy  is  in  5^ (Xo).  Now  write 

n 

f(X)  -  /(Xo)  =  f(Xn)  -  /(Xo)  =       If&j)  ~  m,-i)l  (5-3.25) 

7  =  1 

and  consider  the  auxiliary  functions 

gl(t)  =  f(t,X20,---,Xn0), 

gj(t)  =  f(xi,...,Xj-i,t,Xj+ii0,...,xno),    2<j<n-\,  (5.3.26) 

gn(t)  =  f(xi,...,Xn-\,t), 

where,  in  each  case,  all  variables  except  t  are  temporarily  regarded  as  constants.  Since 

f(Kj)-f(Xj-l)  =  gj(xj)-gj(xJo), 
the  mean  value  theorem  implies  that 

f(Xj)-f(Xj-l)  =  g'j(Tj)(Xj-Xj0), 
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where  Xj  is  between  Xj  and  Xjo-  From  (5.3.26), 

gfJ(rJ)  =  fXj&j), 
where  X/  is  on  the  line  segment  from  X;— i  to  X/.  Therefore, 

f(Xj)  -  /(Xj-!)  =  fXj  (Xj)(Xj  -  Xjo), 

and  (5.3.25)  implies  that 

n 

f(X)  -  /(X0)  =       f*j  KjXxj  -  xJo) 


fxy  (XQ)(xj  -  xj0)  +  Ifxj  (%)  -  hi  (Xo)](^  -  xj0). 
y=i  ./=i 


From  this  and  (5.3.24), 


/(X)  -  f(X0)  -  hi  (Xo)(*y  -  XJ0) 

.7  =  1 


<  e      \xj  -xj0\  <  ne|X-X0| 

7  =  1 


which  implies  that  /  is  differentiable  at  Xo.  H 

We  say  that  /  is  continuously  differentiable  on  a  subset  5  of  R"  if  S  is  contained  in  an 
open  set  on  which  fXl ,  fX2,  . . . ,  fXn  are  continuous.  Theorem  5.3.10  implies  that  such  a 
function  is  differentiable  at  each  Xo  in  S. 

Example  5.3.8  If 

x2  +  y2 

f(x,y)  =  , 

x-y 

then 

ft      \        2x       x2  +  y2        A     f,      v        2y        x2  +  y2 

fx(x,y)  =  and    fy(x,y)  =  +  — . 

x-y     (x-y)2  x-y  (x-y)2 

Since  fx  and  fy  are  continuous  on 

S  =  {(x,y)\x  ^  y}, 
f  is  continuously  differentiable  on  S. 

Example  5.3.9  The  conditions  of  Theorem  5.3.10  are  not  necessary  for  differentiabil- 
ity; that  is,  a  function  may  be  differentiable  at  a  point  Xo  even  if  its  first  partial  derivatives 
are  not  continuous  at  Xo .  For  example,  let 

./(.v-v»-{(V-V,2Mn^7-    V  f  V- 

x  =  y. 


330    Chapter  5  Real-Valued  Functions  of  Several  Variables 


Then 

/x(x,  y)  =  2(x  —  j)  sin  cos  ,    x  ^  y, 

x  —  y         x  —  y 

and 

/(s  +  ft,  s) -/(*,*)  A2sin(l/A)-0 
/x(x,x)  =  hm  ;  =  hm   =  0, 

so  fx  exists  for  all  (x,  y),  but  is  not  continuous  on  the  line  y  =  x.  The  same  is  true  of  fy , 
since 

/^(x,  j)  =  — 2(x  —  y)  sin  h  cos  ,    x  ^  y, 


and 


x  —  y  x  —  y 


/(x,x  +  £)-/(x,x)  fc2sin(-l/fc)-0 
/v(x,  x)  =  hm   =  hm   =  0. 


Now, 

f(x,  y)  -  /(0, 0)  -  fx(0, 0)x  -  fy(0, 0)y 


(x-y)2    .  1 


sin  ■ 


4x^ 


r 


,     .  i/y, 

=  \  V*2  +  y2 

0,  x  =  y, 


and  Schwarz's  inequality  implies  that 


(x-y)2    .  l 

sin  ■ 


Vx2  +  y2  *-J> 


2(x2+Z2)  AT— - 
<  —j=        -  =  2y/x2  +  y2,  x^y. 
jx2  +  y2 


Therefore, 


,.       fix,  y)  -  /(o,  Q)  -  fx(P,  0)x  -  fy(P,  0)y 
hm   ,    =  0, 


(*,;y)->-(0,0)  ^jx2  +  y2 

so  /  is  differentiable  at  (0,  0),  but  fx  and  fy  are  not  continuous  at  (0,  0). 


Geometric  Interpretation  of  Differentiability 

In  Section  2.3  we  saw  that  if  a  function  /  of  one  variable  is  differentiable  at  xq,  then  the 
curve  y  =  f(x)  has  a  tangent  line 

y  =  T(x)  =  /(x0)  +  /'(x0)(x  -  x0) 

that  approximates  it  so  well  near  xo  that 

lim/W-rW=0. 

x->xq        X  —  Xo 

Moreover,  the  tangent  line  is  the  "limit"  of  the  secant  line  through  the  points  (xi,  /(xo)) 
and  (xq,  f(xo))  as  xi  approaches  xq. 
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z 
k 


y 


X 

Figure  5.3.1 

Differentiability  of  a  function  of  n  variables  has  an  analogous  geometric  interpretation. 
We  will  illustrate  it  for  n  =  2.  If  /  is  defined  in  a  region  D  in  R2,  then  the  set  of  points 
(x,y,z)  such  that 

z  =  f(x,y),     (x,y)eD,  (5.3.27) 
is  a  surface  in  M3  (Figure  5.3.1). 


z 


Figure  5.3.2 

If  /  is  differentiable  at  Xo  =  (xo,  JoX  tnen  tne  plane 

z  =  T(x,  y)  =  /(Xo)  +  /»(Xo)(jc  -  x0)  +  fy(X0)(y  -  y0) 


(5.3.28) 


intersects  the  surface  (5.3.27)  at  (xo,  yo,  f(xo,  yo))  and  approximates  the  surface  so  well 
near  (xq,  yo)  that 
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f(x,  y)  —  T(x,  y) 
lim  71/7  JJ       =  0 

(x,y)^(xo,yo)  J(x-  X0)2  +  (j  -  y0)2 

(Figure  5.3.2).  Moreover,  (5.3.28)  is  the  only  plane  in  M3  with  these  properties  (Exer- 
cise 5.3.25).  We  say  that  this  plane  is  tangent  to  the  surface  z  =  f(x,  y)  at  the  point 
(xq,  yo,  f  (xo,  yo)).  We  will  now  show  that  it  is  the  "limit"  of  "secant  planes"  associated 
with  the  surface  z,  =  f(x,y),  just  as  a  tangent  line  to  a  curve  y  =  f  (x)  in  R3  is  the  limit 
of  secant  lines  to  the  curve  (Section  2.3). 

Let  X,  =  (xi,yi)  (i  =  1,  2,  3).  The  equation  of  the  "secant  plane"  through  the  points 
(xi,  yi,  f(Xi,  yi))  (i  =  1,  2,  3)  on  the  surface  z.  =  f(x,  y)  (Figure  5.3.3)  is  of  the  form 

z  =  /(Xo)  +  A(x  -  x0)  +  B(y  -  y0),  (5.3.29) 

where  A  and  B  satisfy  the  system 

/(XO  =  /(Xo)  +  A(xx  -  x0)  +  B(yi  -  y0), 
/(X2)  =  /(Xo)  +  A(x2  -  x0)  +  B(y2  -  y0). 

Solving  for  A  and  B  yields 

A     (/(XO  -  f(X0))(y2  -  yo)  -  (/(XO  -  /(X0))(j!  -  yo) 


(xi  -  x0)(y2  -  yo)  -  (x2  -  x0)(yi  -  yo) 


(5.3.30) 


and 


if 


B  _  (/(X2)  -  f(Xo))(Xl  -  xq)  -  (/(XQ  -  f(X0))(x2  -  xp) 
(xi  -  x0)(y2  -  yo)  -  (x2  -  x0)(y1  -  y0) 


(xi  -  x0)(y2  -  yo)  -  (x2  -  xo)(y\  -  yo)  +  0,  (5.3.32) 


which  is  equivalent  to  the  requirement  that  Xo,  Xi,  and  X2  do  not  lie  on  a  line  (Exer- 
cise 5.3.23).  If  we  write 

Xi  =  X0  +  fU    and    X2  =  X0  +  fV, 

where  U  =  (u\,u2)  and  V  =  {v\,v2)  are  fixed  nonzero  vectors  (Figure  5.3.3),  then 
(5.3.30),  (5.3.31),  and  (5.3.32)  take  the  more  convenient  forms 


/(Xp  +  fU)  -  /(Xp)        /(Xp  +  fV)  -  /(Xp) 
 1  V2  1  Uz 

U\V2— U2V\ 

/(Xo  +  tV)  -  /(Xo)        /(Xo  +  fU)  -  /(X0) 

-U\  V\ 


(5.3.33) 


and 


B  =  (-  f-  ,  (5.3.34) 

UiV2  -  U2V\ 


u\v2  -  u2vi  ^  0. 
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If  /  is  differentiable  at  Xq,  then 


where 


S  ' —       /  / 
/                      /  / 

 >- 

x, 

Figure  5.3.3 

u 

-  x0)  +  fy(X0)(y  - 

-yo)  +  e(X)\X 

lim  e(X)  =  0. 


(5.3.36) 

Substituting  first  X  =  X0  +  rU  and  then  X  =  X0  +  rV  in  (5.3.35)  and  dividing  by  t  yields 
/(Xq  +  tV)  -  /(Xo) 


and 


where 


so 


/(Xo  +  fV)  -  /(Xo) 


/*(Xo)wi  +  /,(Xo)w2  +  £i(0|U| 


=  /*(X0>i  +  /,(Xo)i>2  +  £2(0|V|, 


(5.3.37) 


(5.3.38) 


£i(f)  =  e(X0  +  rU)|f|/r    and    E2(t)  =  e(X0  +  tY)\t\/t , 


Um  Ei(t)  =  0,    i  =  1,2, 


(5.3.39) 

because  of  (5.3.36).  Substituting  (5.3.37)  and  (5.3.38)  into  (5.3.33)  and  (5.3.34)  yields 

4  =  A(X0)  +  Aj(0,     5  =  /,(Xo)  +  A2(0,  (5.3.40) 

where 
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v2\V\E1(t)-u2\V\E2(t) 


and 


so 


Ai(0  = 
A2(0  = 


UiV2  -  U2V\ 

m|V|£2(0-«i|U|gi(0 

U\V2-U2Vi 


limA,(O  =  0,    z  =  1,2,  (5.3.41) 

because  of  (5.3.39). 

From  (5.3.29)  and  (5.3.40),  the  equation  of  the  secant  plane  is 

z.  =  /(X0)  +  [fx(X0)  +  Aj(0](x  -  jc0)  +  [fy(X0)  +  A2(t)](y  -  y0). 

Therefore,  because  of  (5.3.41),  the  secant  plane  "approaches"  the  tangent  plane  (5.3.28)  as 
t  approaches  zero. 

Maxima  and  Minima 

We  say  that  Xo  is  a  local  extreme  point  of  /  if  there  is  a  S  >  0  such  that 

/(X)  -  /(Xo) 

does  not  change  sign  in  S^(Xo)  D  D  f.  More  specifically,  Xo  is  a  local  maximum  point  if 

/(X)  <  /(Xo) 

or  a  local  minimum  point  if 

/(X)  >  /(X0) 

forallXin5s(X0)n  Df. 

The  next  theorem  is  analogous  to  Theorem  2.3.7. 

Theorem  5.3.11  Supposethat  f is  defined  in  a  neighborhood ofXo  inR"  and  /xl(Xo), 
/x2(Xo),  •  •  • ,  fxn  (^o)  exist.  Let  Xo  be  a  local  extreme  point  of  f.  Then 

fx.(X0)  =  0,     1  <i<n.  (5.3.42) 

Proof  Let 

Ei  =  (1,0,...,  0),    E2  =  (0,1,0,... ,0),...,    E„  =  (0,0,...,  1), 

and 

gt(t)  =  /(Xo  +  fE,),  l<i<n. 
Then  g*  is  differentiable  at  f  =  0,  with 

g;(o)  =  /x;(x0) 
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(Definition  5.3.1).  Since  Xo  is  a  local  extreme  point  of  f,  to  =  0  is  a  local  extreme  point 
of  gi .  Now  Theorem  2.3.7  implies  that  g',  (0)  =  0,  and  this  implies  (5.3.42).  H 

The  converse  of  Theorem  5.3.11  is  false,  since  (5.3.42)  may  hold  at  a  point  Xo  that  is 
not  a  local  extreme  point  of  /.  For  example,  let  Xo  =  (0,  0)  and 

f(x,y)  =  x3  +  y3. 

We  say  that  a  point  Xo  where  (5.3.42)  holds  is  a  critical  point  of  /.  Thus,  if  /  is  defined 
in  a  neighborhood  of  a  local  extreme  point  Xo,  then  Xo  is  a  critical  point  of  / ;  however,  a 
critical  point  need  not  be  a  local  extreme  point  of  /. 

The  use  of  Theorem  5.3.11  for  finding  local  extreme  points  is  covered  in  calculus,  so  we 
will  not  pursue  it  here. 

5.3  Exercises 


1.  Calculate  3/(X)/ 3$. 

(a)  f(x,  y)  =  x2  +  2xy  cos  x,    <I>  = 

(b)  /(w)  =  c-^,  *  =  (J_.__L_L) 

<c)  /<x)  =  ixp,  *  =  (4=.  4=.-, 4=) 

(d)    f{x,y,z)  =  log(l  +  JC  +  y+z),    $  =  (0,1,0) 

2.  Let 

(xy  sin x 
i-  (x,y)^(0,0), 
x2  +  y2 
0,  (x,j)  =  (0,0), 

and  let  $  =  {(p\,  (p2)  be  a  unit  vector.  Find  3/(0,  0)/3$. 

3.  Find  3 / (Xo)/3$,  where  $  is  the  unit  vector  in  the  direction  of  Xi  —  X) . 

(a)  f(x,y,z)  =  shurxyz;   X0  =  (l,l,-2),   Xi  =  (3,2,-1) 

(b)  f(x,  y,z)  =  e-^2+y2+2^;    X0  =  (1,0,-1),    Xi  =  (2,0,-1) 

(c)  f(x,y,z)  =  \og(l  +  x  +  y  +  z);    X0  =  (1,0,1),    X1  =  (3,0,-1) 

(d)  /(X)  =  |X|4;    X0  =  0,    Xi  =  (1,1,...,1) 

4.  Give  a  geometrical  interpretation  of  the  directional  derivative  df(xo,  yo)/d<!>  of  a 
function  of  two  variables. 

5.  Find  all  first-order  partial  derivatives. 

(a)  f(x,  y,  z)  =  log(x  +  y  +  2z)     (b)  f(x,  y,  z)  =  x2  +  3xyz,  +  2xy 

(c)  f(x,  y,  z.)  =  xeyz  (d)  f(x,  y,  z)  =  z.  +  sinx2j 

6.  Find  all  second-order  partial  derivatives  of  the  functions  in  Exercise  5.3.5. 
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7.  Find  all  second-order  partial  derivatives  of  the  following  functions  at  (0,  0). 

(xy(x2  —  y2 
0,  (x,y)  =  (0,0) 

(^tan-^-^tan"1-,    x  ±  0,     y  jt  0, 
(b)   /(*,)>)  =  x  y 

( 0,  x  =  0    or    y  =  0 

(Here  |  tan-1  u\  <  jt/2.) 

8.  Find  a  function  /  =  f(x,y)  such  that  fxy  exists  for  all  (x,y),  but  /j,  exists 
nowhere. 

9.  Let  u  and  u  be  functions  of  two  variables  with  continuous  second-order  partial 
derivatives  in  a  region  S .  Suppose  that  ux  =  vy  and  uy  =  —vx  in  S.  Show 
that 

Uxx  ~t~  Uyy  =  VXx  ~t~  Vyy  =  0 

in  5. 

10.  Let  /  be  a  function  of  (xi,  Xz,  ■  ■  ■ ,  xn)  (n  >  2)  such  that  fXi ,  fXj,  and  fXiXj  (i  ^ 
j )  exist  on  a  neighborhood  of  Xo  and  fXiXj  is  continuous  at  Xo.  Use  Theorem  5.3.3 
to  prove  that  fX/Xj  (X0)  exists  and  equals  fx.x.  (X0). 

11.  Use  Exercise  5.3.10  and  induction  on  r  to  prove  Theorem  5.3.4. 

12.  Let  r\ ,r%,...,  r„  be  nonnegative  integers  such  that 

r\  +  r2-\  1-  rn  =  r  >  0. 

(a)  Show  that 

(ZX+Z2  +  -  +  z„Y  =      ri,j',.rw,z?42  •  •  -4". 

where  denotes  summation  over  all  «-tuples  (r\,  r%, . . .  ,rn)  that  satisfy 
the  stated  conditions.  Hint:  This  is  obvious  if  n  =  1,  and  it  follows  from 
Exercise  1.2.19  if  n  =  2.  Use  induction  on  n. 

(b)  Show  that  there  are 

r! 

n\r2\---rn\ 

ordered  «-tuples  of  integers  (z'i,  i2, . . . ,  in)  that  contain  r\  ones,  r2  twos, 
and  rn  n's. 

(c)  Let  /  be  a  function  of  (x\ ,  x2, . . . ,  xn).  Show  that  there  are 

H 


! !  ■  ■  ■  r_ ! 


r\\r2\ 


partial  derivatives  fXi^Xi  -XiT  that  involve  differentiation  r,  times  with  respect 
to  Xi ,  for  i  =  1,2, ...  ,n. 


13.     Prove  Lemma  5.3.8. 
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14.     Show  that  the  function 


2 

x  y 


[0,  (x,y)  =  (0,0), 

has  a  directional  derivative  in  the  direction  of  an  arbitrary  unit  vector  O  at  (0,  0),  but 
/  is  not  continuous  at  (0,  0). 

15.  Prove:  If  fx  and  fy  are  bounded  in  a  neighborhood  of  (xo,  yo),  then  /  is  continuous 
at  (x0,  yo)- 

16.  Show  directly  from  Definition  5.3.5  that  /  is  differentiable  at  Xo. 

(a)  f(x,y)  =  2x2  +  3xy  +  y2,    X0  =  (l,2) 

(b)  f(x,y,z)  =  2x2  +  3x  +  4yz,    X0  =  (1, 1, 1) 

(c)  /(X)  =  |X|2,    X0  arbitrary 

17.  Suppose  that  fx  exists  on  a  neighborhood  of  (xo,  yo)  and  is  continuous  at  (xo,  yo), 
while  fy  merely  exists  at  (xo,  yo)-  Show  that  /  is  differentiable  at  (xo,  yo)- 

18.  Find  df  and  dxQf,  and  write  (dx0f  )(X  —  Xo). 

(a)  f(x,  y)  =  x3  +  4xy2  +  2xy  sinx,    X0  =  (0, -2) 

(b)  f(x,  y,  z)  =  e~(-x+y+z\    X0  =  (0, 0, 0) 

(c)  /  (X)  =  log(l  +  xx  +  2x2  +  3x3  +  •  ■  •  +  nxn),    X0  =  0 

(d)  f(X)  =  \X\2r,    X0  =  (l,l,l,...,l) 

19.  (a)    Suppose  that  /  is  differentiable  at  Xo  and  $  =  ((pi,  (p2,  ■  ■  ■ ,  <pn)  is  a  unit 

vector.  Show  that 

=  fx,  (Xo)0i  +  A2(Xo)</>2  H  h  fx„QLo)<pn- 

(b)    For  what  unit  vector  $  does  3/(Xo)/3<l>  attain  its  maximum  value? 

20.  Let  /  be  defined  on  R"  by 

/(X)  =  g(xi)  +  g(x2)  +  ■■■  +  g(x„), 

where 

g(u)  = 


■  1 

I  u  sin  — ,    u  0, 

i  u 

1  0.  u  =  0. 


Show  that  /  is  differentiable  at  (0,  0, ... ,  0),  but  fxi ,  fX2,  . . .,  fXn  are  all  discon- 
tinuous at  (0,  0, ... ,  0). 

21.  The  purpose  of  this  exercise  is  to  show  that  if  /,  fx  and  fy  exist  on  a  neighborhood 
N  of  (xo^o)  and  fx  and  fy  are  differentiable  at  (xo^o),  then  fxy  (xo,  yo)  = 
fyx(xo,  yo)-  Suppose  that  the  open  square 

{(x,y)\  \x-x0\  <  \h\,\y-y0\  <  \h\} 
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is  in  N .  Consider 

B(h)  =  f(x0  +  h,y0  +  h)  -  f(x0  +  h,  yo)  -  f(x0,  yo  +  h)  +  f(x0,  yo). 

(a)  Use  the  mean  value  theorem  as  we  did  in  the  proof  of  Theorem  5.3.3  to  write 

B(h)  =  [fx(x,y0  +  k)-Mx,y0)]h, 

where  x  is  between  xq  and  xq  +  h.  Then  use  the  differentiability  of  fx  at 
(xo,  yo)  to  infer  that 

EUh) 

BQi)  =  h2fxy(xo,  yo)  +  hEAh),     where      lim  — — -  =  0. 

h^o  h 

(b)  Use  the  mean  value  theorem  to  write 

B(h)  =  [fy(xo+h,y)-fy(xo,y)]h, 

where  ~y  is  between  yo  and  yo  +  h.  Then  use  the  differentiability  of  fy  at 
(xq,  yo)  to  infer  that 

B(h)  =  h2fyx(x0,  yo)  +  hE2(h),    where     lim  =  0. 

/i->o  h 

(c)  Infer  from  (a)  and  (b)  that  fxy(x0,  yo)  =  fyx(x0,  y0). 

22.  (a)   Let  fx.  and  fx.  be  differentiable  at  a  point  Xo  in  W.   Show  from  Exer- 

cise 5.3.21  that 

(b)  Use  (a)  and  induction  on  r  to  show  that  all  (r  —  l)-st  order  partial  derivatives 
of  /  are  differentiable  on  an  open  subset  S  of  Rn,  then  fXi  Xj2-Xjl.  (X)  (X  e  S) 
depends  only  on  the  number  of  differentiations  with  respect  to  each  variable, 
and  not  on  the  order  in  which  they  are  performed. 

23.  Prove  that  (xo,  yo),  (xi,  yi),  and  {xj,  y%)  lie  on  a  line  if  and  only  if 

(xi  -  x0)(j2  -  yo)  -  (x2  -  x0)(yi  -  yo)  =  o. 

24.  Find  the  equation  of  the  tangent  plane  to  the  surface 

z  =  f(x,  y)   at    (x0,  y0,  z0)  =  (xo,  yo,  f(x0,  yo))- 

(a)  f{x,y)=x2  +  y2-\,    (x0,  y0)  =  (1,2) 

(b)  f(x,y)  =  2x  +  3y  +  l,    (x0,  y0)  =  (1, -1) 

(c)  fix,  y)  =  xysmxy,    (x0,  yo)  =  (1, 

(d)  f(x,y)  =  x2-2y2  +  3xy,     (x0,  yo)  =  (2, -1) 
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25.     Prove:  If  /  is  differentiable  at  (xq,  yo)  and 

,.  f(x,y)-a-b(x-x0)-c(y-y0)  _ 
hm     =  0, 

(x,y)^(xQ,y0)  ^J(x  -x0)2  +  (y  -  y0)2 

then  a  =  f(x0,  y0),  b  =  fx(x0,  y0),  and  c  =  fy(x0.  yo)- 
5.4  THE  CHAIN  RULE  AND  TAYLOR'S  THEOREM 

We  now  consider  the  problem  of  differentiating  a  composite  function 

h(U)  =  /(G(U)), 

where  G  =  (gi,  g2,  ■  ■  ■ ,  gn)  is  a  vector-valued  function,  as  defined  in  Section  5.2.  We 
begin  with  the  following  definition. 

Definition  5.4.1  A  vector-valued  function  G  =  (g\,  g2, . . .  ,gn)  is  differentiable  at 

Uo  =  (Uio,  U20,  .  .  •  ,  Mfflo) 

if  its  component  functions  g\,  g2,  ■  ■  ■ ,  gn  are  differentiable  at  Uo-  ■ 

We  need  the  following  lemma  to  prove  the  main  result  of  the  section. 
Lemma  5.4.2  Suppose  that  G  =  (gi ,  g%, . . . ,  gn)  is  differentiable  at 

Uo  =  (u\q,  U20,  ■  ■  ■ ,  umo), 

and  define 

M 


\i'=i  j=i  J 


Then,  if  e  >  0,  there  is  a  8  >  0  such  that 
|G(U)-G(U0 


|U-Uo| 


<M  +  e    if  0<|U-U0|<<5. 


Proof  Since  g\,  g2,  ■  ■ .,  gn  are  differentiable  at  Uo,  applying  Lemma  5.3.8  to  gi  shows 
that 

gi  (U)  -  gi  (Uo)  =  (dVogi)(V  -  Uo)  +  Ei  (U)  |  (U  -  Uo  | 

^3g«(Uo),  ^va^m    it  i  (5A1) 

=  2^  "5  W  _  uJo)  +  £,(U)|(U  -U0|, 

7=1  dUj 
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where 


lim  Ei(V)  =  0,  1  <  i  <  n. 
u^Uo 


(5.4.2) 


From  Schwarz's  inequality, 

\gi(V)  -gi(V0)\  <  {Mi  +  |£,-(U)|)|U-Uo| 


where 


Therefore, 


From  (5.4.2), 


1/2 


|G(U)-G(U0)| 
|U-Uo| 


<  I)2 


V;=l 


1/2 


1/2 


1/2 


Jim  J2(M<  + 1 E<  cu)D2     =  I E M" 


V(=l 


V;=l 


which  implies  the  conclusion. 

The  following  theorem  is  analogous  to  Theorem  2.3.5. 


U 


Theorem  5.4.3  (The  Chain  Rule)  Suppose  that  the  real-valued  function  f  is 
differentiable  at  Xo  in  W ,  the  vector-valued  function  G  =  (g\ ,  gi,  ■  ■  ■ ,  gn)  is  differentiable 
flfUo  inRm,  andXo  =  G(Uo).  Then  the  real-valued  composite  function  h  =  f  oG  defined 
by 

h(U)  =  /(G(U))  (5.4.3) 

is  differentiable  at  Uo,  and 


dv0h  =  fXl(X0)dVog1  +  fX2(X0)dVog2  H  h  fXn(X0)dVog„. 


(5.4.4) 


Proof  We  leave  it  to  you  to  show  that  Uo  is  an  interior  point  of  the  domain  of  h  (Exer- 
cise 5.4.1),  so  it  is  legitimate  to  ask  if  h  is  differentiable  at  Uo- 

Let  Xo  =  {x\o,  x2o,  •  •  • ,  x„o).  Note  that 

Xio  =  gi(Uo),     1  <  /  <  n, 
by  assumption.  Since  /  is  differentiable  at  Xo,  Lemma  5.3.8  implies  that 

n 

f(X)  -  /(Xo)  =       A/  (X°X*''  -  *o)  +  E(X)\X  -  X0|.  (5.4.5) 


7=1 


where 


lim  E(X)  =  0. 
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Substituting X  =  G(U)  and  X0  =  G(U0)  in  (5.4.5)  and  recalling  (5.4.3)  yields 

n 

h(U)-h(Vo)  =       A,.(X0)fe  (U)  -gi(Vo))  +  £(G(U))|G(U)  -G(U0)|.  (5.4.6) 

(=i 

Substituting  (5.4.1)  into  (5.4.6)  yields 


h(U)  -  h(V0)  =  J2f*i  (Xo)(rfu0£/)(U  -  Uo)  +  (  J2  fxi  (Xo)£/  (U) )  |U  -  U0| 

i  =  l 

+  £(G(U))|G(U)-G(U0|. 


V;=l 


Since 


lim  £(G(U))  =   lim  E(X)  =  0, 


(5.4.2)  and  Lemma  5.4.2  imply  that 

n 

h(V)  -  h(V0)  -      fxi  (Xodvogi  (U  -  Uo) 


'  =  '   =0. 


|U-Uo| 

Therefore,  h  is  differentiable  at  Uo,  and  d\jQh  is  given  by  (5.4.4).  TJ 
Example  5.4.1  Let 

f(x,y,z)  =  2x2  +  4xy  +  3yz, 
gi(u,v)  =  u2  +  v2,    g2(u,v)  =  u2 —2v2,    gj(u,v)  =  uv, 

and 

h(u,  v)  =  f(gi(u,  v),  g2(u.  v),  g3(u,  v)). 

LetUo  =  (1,-1)  and 

X0  =  (gi(U0),g2(Uo),?3(Uo))  =  (2,-1,-1). 

Then 

A(X0)  =  4,    fy(X0)  =  5,    A(X0)  =  -3, 

3gi(Uo)  _  2     3gi(Uo)  _  2 

du  dv 

9g2(Uo)  _  2     9g2(Uo)  _  4 

du  dv 

9g3(Uo)  _  1      9g3(Uo)  _  j 

9w  3d 

Therefore, 

d\jQg\  =  2du  —  2dv,    d\j0g2  =  2du  +  Adv,    d\jQg3  = —du  +  dv, 
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and,  from  (5.4.4), 

dvQh  =  fx(X0)  dVogi  +  fy(X-o)  dVog2  +  /Z(X0)  dVog3 
=  4(2  du  —  2  dv)  +  5(2  du  +  4  dv)  —  3(—du  +  dv) 
=  21  du  +  9dv. 

Since 

dVoh  =  /zM(Uo)  du  +  hv(U0)  dv 

we  conclude  that 

MUo)  =  21    and    hv(U0)  =  9.  (5.4.7) 
This  can  also  be  obtained  by  writing  h  explicitly  in  terms  of  (u,  v)  and  differentiating;  thus, 

h(u,  v)  =  2[gi(u,  v)]2  +  4gi(u,  v)g2(u,  v)  +  3g2(u,  v)g3(u,  v) 

=  2(u2  +  v2)2  +  4(u2  +  v2)(u2  -  2v2)  +  3(u2  -  2v2)uv 

=  6u4  +  3u3v  — 6uv3  — 6v4. 

Hence, 

hu(u,v)  =  24w3  +  9u2v  -  6v3    and     hv(u,  v)  =  3u3  -  I8uv2  -  24u3, 
soA„(l,-l)  =  21  andhv(l,-Y)  =  9,  consistent  with  (5.4.7). 

Corollary  5.4.4  Under  the  assumptions  of  Theorem  5.4.3, 

9KUo)      A  3AXo)^(Uo)_  l<i<m_ 

Proof  Substituting 

9gf(U0)  3gl-(U0)  ,  ,  dgi(Vo)  j 

"v0gi  =  — «  »mi  H  a«2  H  1  ^  dum,     1  <  i  <n, 

au\  ou2  dum 

into  (5.4.4)  and  collecting  multipliers  of  du\,  du2,  •  •  • ,  dum  yields 
a   h     A^g/go)  B8j(V0)\ 


However,  from  Theorem  5.3.6, 


"  3A(D0)  , 

"u0«  =  2^  — 5  dui- 

i=i  dUi 

Comparing  the  last  two  equations  yields  (5.4.8).  H 
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When  it  is  not  important  to  emphasize  the  particular  point  Xo,  we  write  (5.4.8)  less 
formally  as 

dh  df  9j?/ 

r=V^  l<i<m,  (5.4.9) 
dUi      ™  ox/  out 

with  the  understanding  that  in  calculating  3/!(Uo)/3w;,  dgj/diii  is  evaluated  at  Uo  and 
df/dxj  atX0  =  G(U0). 

The  formulas  (5.4.8)  and  (5.4.9)  can  also  be  simplified  by  replacing  the  symbol  G  with 
X  =  X(U);  then  we  write 

h(U)  =  /(X(U)) 


and 


or  simply 


3/»(Uo)  _  y  df(X0)  dxj(V0) 

dui        ^-J    dxj  dui 


dh      ^-v  3/  3^/ 


3l// 


^  9x;  3m;-  *~  ^ 

i  =  l  y 


Example  5.4.2  Let  (r,  (9)  be  polar  coordinates  in  the  xy-p\ane;  that  is, 

x  =  r  cos  0,     j  =  r  sin  0. 
Suppose  that  /  =  f(x,  y)  is  differentiable  on  a  set  5,  and  let 

h(r,6)  =  f(r  cos#,  rsin#). 
If  (r  cos  9,  r  sin9)  e  5,  (5.4.10)  implies  that 

dh      df  dx     df  dy  Jf  Jf 

it  =  ^~^r  +  -ir4-  =  cos  6  tt- +  sin  9  4-  (5.4.11) 

dr      dx  dr      dy  dr  dx  dy 


and 


dh      dfdx     dfdy  .     df  df 

Y9  =  TxT9+TyT9=  -r*m9Tx  +rcos%> 


where  fx  and  fy  are  evaluated  at  (x,  y)  =  (r  cos  9,  r  sin  9).  ■ 

The  proof  of  Corollary  5.4.4  suggests  a  straightforward  way  to  calculate  the  partial 
derivatives  of  a  composite  function  without  using  (5.4.10)  explicitly.  If  h(XJ)  =  f  (X(U)), 
then  Theorem  5.4.3  ,  in  the  more  casual  notation  introduced  before  Example  5.4.2,  implies 
that 

dh  =  fxidxi  +  fX2dx2-\  h  fx„dxn,  (5.4.12) 

where  dx\,  dx2,  dxn  must  be  written  in  terms  of  the  differentials  du\,  dii-i,  ■  ■  . ,  dum 
of  the  independent  variables;  thus, 
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,         dxi   ,         dxi   ,  dxj  , 

dxi  =  - —  du\  +  - —  du2  H  1-  -r —  dum. 

du\  0U2  oum 

Substituting  this  into  (5.4.12)  and  collecting  the  multipliers  of  du\,  du-i,  ■  ■  ■ ,  dum  yields  (5.4.10). 
Example  5.4.3  If 


then 


But 


hence, 


h(r,6,z)  =  f(x(r,  9),y(r,0),z), 

dh  =  fx  dx  +  fy  dy  +  fz  dz. 

dx  dx  dy  dy 

dx  =  —  dr  +  —  dd    and    dy  =  —  dr  +  —  dd; 
dr  d9  dr  d0 

dh  =  h    d' +  %  M) +  (I d' +  %  M) +  k  dz 


,  dx         dy      ,  dx  dy 


Example  5.4.4  Let 

h(x)  =       y  (x,  z(x))  ,  z(x)). 

Then 

dh  =  fx  dx  +  fydy  +  fz  dz,  (5.4.13) 

dy  =  yxdx  +  yz  dz,  (5.4.14) 

and 

dz=z'dx,  (5.4.15) 

where  the  prime  indicates  differentiation  with  respect  to  x.  Substituting  (5.4.15)  into 
(5.4.14)  yields 

dy  =  (yx  +  yzz')  dx 

and  substituting  this  and  (5.4.15)  into  (5.4.13)  yields 

dh  =  [fx  +  fy(yx  +  yzz.')  +  fzz']  dx; 

hence, 

ti  =  fx  +  fy(yx  +  yzz')  +  fzt'. 

Here  fx,  fy,  and  fz  are  evaluated  at  (x,  y(x,  z(x)),  z(x)),  yx  and  yz  are  evaluated  at 
(x,  z{x)),  and  z!  is  evaluated  at  x. 
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Higher  Derivatives  of  Composite  Functions 

Higher  derivatives  of  composite  functions  can  be  computed  by  repeatedly  applying  the 
chain  rule.  For  example,  differentiating  (5.4.10)  with  respect  to  uk  yields 

d2h     _  A    3    /  df  dx  j  \ 
dukdui      ^-J  duk  \dxj  dui  ) 

A  df  dzxj  A  dxj_J_  (  df_ 
^-J  dxj  dukduj  ^— J  dui  duk  \dxj 


J 

We  must  be  careful  finding 


)uk  \dxj  J 


dXj 


then  (5.4.17)  becomes 


Since 


this  yields 


9g(X(U))  _  A  3g(X(U))  dxs(U) 
duk  3xs  duk 


d2f 


dxs      dxs  dxj  ' 

d2f  dxs 


)uk  \dxk) 


duk  \dxk  J      f-f  dxsdxj  duk 
Substituting  this  into  (5.4.16)  yields 

d2h         A  df    d2xj        A  dxj  A    d2f  3 


(5.4.16) 


which  really  stands  here  for 

3  /3/(X(U))\ 
The  safest  procedure  is  to  write  temporarily 

9/(X) 


duk  duj      '—'  dx  j  duk  dui      ^— '  3w;  ^— '  3x5  9x » 

'        j  =  l       3       K      1       j  =  l       '  s  =  \       s  ■' 

To  compute  hu.Uk(Vo)  from  this  formula,  we  evaluate  the  partial  derivatives  of  x\,  x%, 
x„  at  Uo  and  those  of  /  at  Xo  =  X(Uo).  The  formula  is  valid  if  X\,  X2,  ■  ■  ■ ,  xn  and 
their  first  partial  derivatives  are  differentiable  at  Uo  and  /,  fx. ,  fX2,       fXn  and  their  first 
partial  derivatives  are  differentiable  at  Xo. 

Instead  of  memorizing  (5.4.18),  you  should  understand  how  it  is  derived  and  use  the 
method,  rather  than  the  formula,  when  calculating  second  partial  derivatives  of  composite 
functions.  The  same  method  applies  to  the  calculation  of  higher  derivatives. 
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Example  5.4.5  Suppose  that  fx  and  fy  in  Example  5.4.2  are  differentiable  on  an  open 
set  S  inR2.  Differentiating  (5.4.1 1)  with  respect  to  r  yields 

d2h  -COS0  9  (df\  sin6)  9  (df\ 
dr2  dr  \dx  )  dr  \dy  ) 

-coze(d2fdx  i  92/  dy\  i  zmei  d2f  dx  i  d2fdy\ 

\  dx2  dr      dy  dx  dr  J  \dx  dy  dr      dy2  dr ) 


if  (x,  y)  €  5.  Since 


9jc  „      dy  n  d2f  d2f 

=  cos  9,     —  =  sin  9,  and 


— —  —  lus  u,  — —  —  sin  i/.      aiiu      —  —  —  — — - — 

dr  dr  dx  dy      dy  dx 

if  (x,  y)€  S  (Exercise  5.3.21),  (5.4.19)  yields 

=  cos2  0—4  +2  sin  9  cos  9  —4-  +  sin2  9— 4r. 
dr2  dx2  dx  dy  dy2 

Differentiating  (5.4.1 1)  with  respect  to  9  yields 

d2h  Jf  Jf  „  3  (9f\  „  d  (df\ 

-sin^—  +  cos  6—  +  cos  6—    —    +  sin  9—  — 
dx  dy  d9  V  dx  J  d9  V  dy  J 

Jf  Jf  „  /32/  9JC      32/  dy  ' 

l9dx-+COS%+COSe{jx^d9+Wrxd9, 
(  /  32/  dx     d2f  dy\ 
' \dx  dyd9  +  Jy^WJ 

Since 


d9dr 

=  —  sin  f 

+  sin 


dx      ■  a    a  dy 

—  =  —  r  sin  9    and    —  =  r  cos 

d9  d9 


it  follows  that 


d2h 

=  —  sin 


d9dr 


1  h  cos  9  —  r  sin  9  cos  9  I  — —  — -  I 

dx  dy  V  9y  / 


32  f 

+r(cos26»-sin26»)— 

dxdy 


(5.4.19) 


The  Mean  Value  Theorem 

For  a  composite  function  of  the  form 

h(t)  =  f(x1(t),x2(t),...,x„(t)) 

where  t  is  a  real  variable,  x\,  x2,  ■  ■  ■ ,  xn  are  differentiable  at  fo,  and  /  is  differentiable  at 
X0  =  X(r0),  (5.4.8)  takes  the  form 

n 

h'(t0)  =       h j  mo))x'j  (t0).  (5.4.20) 
This  will  be  useful  in  the  proof  of  the  following  theorem. 
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Theorem  5.4.5  (Mean  Value  Theorem  for  Functions  of  n  Variables) 

Let  f  be  continuous  at Xi  =  (xn,  .T21. . . . ,  xn\)  andX2  =  (xn,  X22,  •  •  ■ ,  Xni)  and  dif- 
ferentiable  on  the  line  segment  L  from  Xi  to  X2.  Then 

n 

f(X2)-f(X1)  =  YJfxi^o){xi2-xil)  =  (Jxo/)(X2-X!)  (5.4.21) 
i=l 

for  some  Xo  on  L  distinct  from  Xi  and  X2. 

Proof   An  equation  of  L  is 

X  =  X(0  =  rX2  +  (l-r)Xi,  0<r<l. 
Our  hypotheses  imply  that  the  function 

h(t)  =  f(X(t)) 
is  continuous  on  [0,  1]  and  differentiable  on  (0,  1).  Since 

xi(t)  =  txi2  +  (1  -  t)xn, 

(5.4.20)  implies  that 

n 

h'(t)  =       fxt  (X(0)(**2  -  xn),    0  <  t  <  1. 
i=i 

From  the  mean  value  theorem  for  functions  of  one  variable  (Theorem  2.3.1 1), 

h(l)-h(0)  =h'(to) 

for  some  t0  e  (0,  1).  Since  /z(l)  =  /(X2)  and  h(0)  =  /(Xi),  this  implies  (5.4.21)  with 
X0  =  X(f0).  H 

Corollary  5.4.6  If  fxi ,  fX2,  ... ,  fXn  are  identically  zero  in  an  open  region  S  of  MP, 
then  f  is  constant  in  S. 

Proof   We  will  show  that  if  Xo  and  X  are  in  S,  then  / (X)  =  / (Xo).  Since  S  is  an  open 
region,  S  is  polygonally  connected  (Theorem  5.1.20).  Therefore,  there  are  points 

Xo,  Xi, . . . ,  X„  =  X 

such  that  the  line  segment  L,-  from  X;-i  to  X;  is  in  S,  1  <  i  <  n.  From  Theorem  5.4.5, 

n 

f(Xt)  -  /(X^)  =  -x,_o, 

i=\ 

where  X  is  on  L,  and  therefore  in  5.  Therefore, 

fxt  (% )  =  fX2  (%)  =  ■  ■  ■  =  fXn  (% )  =  0, 
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which  means  that  erg.  f  =  0.  Hence, 

/(X0)  =  /(X1)  =  ---  =  /(X„); 
that  is,  /(X)  =  /(Xo)  for  every  X  in  S.  H 

Higher  Differentials  and  Taylor's  Theorem 

Suppose  that  /  is  defined  in  an  n-ball  Bp(Xq),  with  p  >  0.  If  X  e  Bp(Xq),  then 
X(0  =X0  +  r(X-X0)  eBp(X),    0  <  r  <  1, 

so  the  function 

h(t)  =  /(X(r)) 

is  defined  forO  <  t  <  1.  From  Theorem  5.4.3  (see  also  (5.4.20)), 


A'(0=£/*,(X(0(*/-*to) 


if  /  is  differentiable  in  Bp(Xo),  and 

"  92/(X(Q)/ 
(J=l 

if  /X1 ,  /X2,  . . . ,       are  differentiable  in  Bp(Xq).  Continuing  in  this  way,  we  see  that 
h{r\t)=  a — 9  /(x(0) — (xh-Xiuo)(xi2-Xi2)o)---(xir-Xir>o)  (5.4.22) 

if  all  partial  derivatives  of  /  of  order  <  r  —  1  are  differentiable  in  fip(Xo). 
This  motivates  the  following  definition. 

Definition  5.4.7  Suppose  that  r  >  1  and  all  partial  derivatives  of  /  of  order  <  r  —  1 
are  differentiable  in  a  neighborhood  of  Xo.  Then  the  rth  differential  of  f  at  Xo,  denoted 
by  d^'f,  is  defined  by 

4?/=      Yl      a,  l.^.^v.  dxhdxi2---dxh,  (5.4.23) 


  -9a- 

11,12,-,' 


!  _.  3xirdxir_1  ■  ■  ■  oa; , 
— i 


where  Jxi,  J^2,  ■  ■  ■ ,  are  the  differentials  introduced  in  Section  5.3;  that  is,  dxi  is  the 
function  whose  value  at  a  point  in  R"  is  the  2  th  coordinate  of  the  point.  For  convenience, 
we  define 


(<?/)  =  /(Xo). 


Notice  that  d£  f  =  dx0f- 
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Under  the  assumptions  of  Definition  5.4.7,  the  value  of 

9r/(X0) 
dxi,,dxir_1  ■■■dxh 

depends  only  on  the  number  of  times  /  is  differentiated  with  respect  to  each  variable, 
and  not  on  the  order  in  which  the  differentiations  are  performed  (Exercise  5.3.22).  Hence, 
Exercise  5.3.12  implies  that  (5.4.23)  can  be  rewritten  as 

<,'/  -  E  „  ,J'. .  r. ,  <#*r  (far  •  ■  •  (fe,)-  ■  (5-4.24, 

where  ^r  indicates  summation  over  all  ordered  H-tuples  (r\,  Ti, . .. ,  r„)  of  nonnegative 
integers  such  that 

r\  +  r2  H  Yrn  =  r 

and  9x;'  is  omitted  from  the  "denominators"  of  all  terms  in  (5.4.24)  for  which  r,  =  0.  In 
particular,  if  n  =  2, 

Example  5.4.6  Let 

=  i  i  n— > 

1  +  ax  +  by 

where  a  and  b  are  constants.  Then 

9' -(-D-H-  fl7y"7' 


so 


9x-/'  3yr--^'  '  (1  +  ax  +  by)r+l 


A°         (1  +  ax0  +  by0)r+1  f^Q\J  ) 


 — - — -  —(a  dx  +  b  dyY 

(l+ax0  +  by0y+iy  y' 


if  1  +  ax0  +  byo  ^  0. 
Example  5.4.7  Let 


f(X)  =  exp  j  -J2aJxJ 


where  a\,  a-i,  ■  ■  . ,  an  are  constants.  Then 
3V(X) 


dxn  dxn  . . .  dxrn  =  «?  ■  ■  ■  <"  exP  (  -  E  <*jXj  ]  • 

12  I 
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Therefore, 


(4?/)(<I>)  =  (-l)r  (E-i-r  :ar1lar22---arn"(dx1yi(dx2)r2...(dxny») 

(  f 

x  exp  I  —  >  ajXji 


7=1 

=  (—  l)r(ai  dx\  +  a2  dx2  +  ■  ■  ■  +  an  dxn)r  exp  I  —  cijXjo 

V  ./=i 

(Exercise  5.3.12).  ■ 

The  next  theorem  is  analogous  to  Taylor's  theorem  for  functions  of  one  variable  (Theo- 
rem 2.5.4). 

Theorem  5.4.8  (Taylor's  Theorem  for  Functions  of  n  Variables)  Suppose 
that  f  and  its  partial  derivatives  of  order  <  k  are  differentiable  at  Xo  and  X  in  W  and  on 
the  line  segment  L  connecting  them.  Then 

f&>  =  E  ^(<)/)(X  -X)  +  _!_(4*+1>/)(X-Xo)  (5.4.25) 

r=0  ' 

for  some  Xon  L  distinct  from  Xo  and  X. 
Proof  Define 

h(t)  =  /(Xo  +  t(X  -  X0)).  (5.4.26) 

With  $  =  X  —  Xo,  our  assumptions  and  the  discussion  preceding  Definition  5.4.7  imply 
that  h,  h',  . . . ,  hS  +r>  exist  on  [0,  1].  From  Taylor's  theorem  for  functions  of  one  variable, 

*  A«(0)  /z^+1)(r) 

for  some  r  e  (0,  1).  From  (5.4.26), 

h(0)  =  /(X0)    and    h(l)  =  /(X).  (5.4.28) 
From  (5.4.22)  and  (5.4.23)  with  $  =  X  -  X0, 

/i(r)(0)  =  (4r0)/)(X  -  Xo),     l<r<k,  (5.4.29) 

and 

A(*+D(T)  =  (4+1/)  (X-X0)  (5.4.30) 
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where 

X  =  X0  +  r(X  -  X0) 

is  on  L  and  distinct  from  X0  and  X.  Substituting  (5.4.28),  (5.4.29),  and  (5.4.30)  into 
(5.4.27)  yields  (5.4.25).  H 

Example  5.4.8  Theorem  5.4.8  and  the  results  of  Example  5.4.6  with  X0  =  (0,  0)  and 
$  =  (x,  y)  imply  that  if  1  +  ax  +  by  >  0,  then 


1 


=  £(-l)>.v +6}')' +  (-!)*+' 


1+ax  +  by      f^Q  (1  +  arx  +  bxy)k+2 

for  some  r  e  (0,  1).  (Note  that  r  depends  on  k  as  well  as  (x,  y).) 

Example  5.4.9  Theorem  5.4.8  and  the  results  of  Example  5.4.7  with  Xo  =  0  and 
$  =  X  imply  that 

"         \       k  (-\y 
exp  I  _  ^2aJxJ  I   — +  a2x2  H  1-  a„x„)r 

j  =  \  J       r=0  r- 

(-l)fc+1 

+      ,  ,.,(aixi  +  a2x2  H  h  anxn)k+l 

(k  +  1)! 

x  exp 


n 


T 


for  some  r  £  (0, 1), 


By  analogy  with  the  situation  for  functions  of  one  variable,  we  define  the  kth  Taylor 
polynomial  of  f  about  Xq  by 


71(X)  =  ^-(4;)/)(X-Xo) 


r=0 

if  the  differentials  exist;  then  (5.4.25)  can  be  rewritten  as 

/(X)  =  Tk(X)  +  —±—(af+1)f)(X-X0). 
(k  +  1)!  x 

A  Sufficient  Condition  for  Relative  Extreme  Values 

The  next  theorem  leads  to  a  useful  sufficient  condition  for  local  maxima  and  minima.  It 
is  related  to  Theorem  2.5.1.  Strictly  speaking,  however,  it  is  not  a  generalization  of  Theo- 
rem 2.5.1  (Exercise  5.4.18). 
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Theorem  5.4.9  Suppose  that  f  and  its  partial  derivatives  of  order  <  k  —  1  are  differ- 
entiable  in  a  neighborhood  N  of  a  point  Xo  in  M"  and  all  kth-order  partial  derivatives  of 
f  are  continuous  at Xq.  Then 


y  /(X)  ~  7fc(X) 

hm  — ■  -. —  =  0. 

x^xo  |X-X0|fc 


(5.4.31) 


Proof  If  e  >  0,  there  is  a  8  >  0  such  that  Bs(X0)  C  N  and  all  fcth-order  partial 
derivatives  of  /  satisfy  the  inequality 


3*/(X) 


3*/(Xo) 


<e,  Xe«j(X0). 


dxikdxik_l  ■  ■  ■  dxtl      dxikdxik_l  ■  ■  ■  dxtl 
Now  suppose  that  X  e  Bg(Xo).  From  Theorem  5.4.8  with  k  replaced  by  k  —  1, 


1 


/(X)  =  rfc_!(X)  +  -(4fc)/)(X  -  Xo), 


(5.4.32) 


(5.4.33) 


where  X  is  some  point  on  the  line  segment  from  Xo  to  X  and  is  therefore  in  Bg(Xo).  We 
can  rewrite  (5.4.33)  as 


/(X)  =  7}fc(X)  +  ^  [(dff)(X  -  Xo)  -  (<V)(X  -  Xo)]  . 
But  (5.4.23)  and  (5.4.32)  imply  that 


(5.4.34) 


(dff)(X  -  Xo)  -  (4oV)(X  -  Xo) 


<nke\X-X0\k 


(5.4.35) 


(Exercise  5.4.17),  which  implies  that 

|/(X)-7HX)|  ^ 

\X-X0\k  k\  SK  °; 

from  (5.4.34).  This  implies  (5.4.31). 

Let  r  be  a  positive  integer  and  Xq  =  (^io,  -^20,  •  ■  • ,  Xno)-  A  function  of  the  form 


U 


p(X)  =  J2anr2-rn(Xl-X1o)ri(X2-X2o)r2---(xn-Xn0Y",  (5.4.36) 


where  the  coefficients  {anr2...r„}  are  constants  and  the  summation  is  over  all  «-tuples  of 
nonnegative  integers  (ri,  r%, . . . ,  rn)  such  that 

r\  +  r2-\  \-rn  =  r, 

is  a  homogeneous  polynomial  of  degree  r  in  X  —  Xo,  provided  that  at  least  one  of  the 
coefficients  is  nonzero.  For  example,  if  /  satisfies  the  conditions  of  Definition  5.4.7,  then 
the  function 

p(X)  =  {d{£f)(X-X0) 
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is  such  a  polynomial  if  at  least  one  of  the  rth-order  mixed  partial  derivatives  of  /  at  Xo  is 
nonzero. 

Clearly,  p(Xo)  =  0  if  p  is  a  homogeneous  polynomial  of  degree  r  >  1  in  X  —  Xo. 
If  p(X)  >  0  for  all  X,  we  say  that  p  is  positive  semidefinite;  if  p(X)  >  0  except  when 
X  =  Xo,  p  is  positive  definite. 

Similarly,  p  is  negative  semidefinite  if  p(X)  <  0  or  negative  definite  if  p(X)  <  0  for  all 
X     Xo.  In  all  these  cases,  p  is  semidefinite. 

With  p  as  in  (5.4.36), 

p(-X  +  2X0)  =  (-1)>(X). 
so  p  cannot  be  semidefinite  if  r  is  odd. 

Example  5.4.10  The  polynomial 

p(x,  y,  z)  =  x2  +  y2  +  z2  +  xy  +  xz.  +  yz 
is  homogeneous  of  degree  2  in  X  =  (x,  y,  z).  We  can  rewrite  p  as 

p(x,  y,  z)  =  -  [(x  +  y)2  +  (y  +  z.)2  +  (z.  +  x)2] , 

so  p  is  nonnegative,  and  p(x,  ~y,  z)  =  0  if  and  only  if 

Y  +  7  =  7  +  r  =  z+  x  =  0, 

which  is  equivalent  to  (x,  ~y,  z)  =  (0,0,0).  Therefore,  p  is  positive  definite  and  —p  is 
negative  definite. 

The  polynomial 

pi(x,y,z.)  =  x2  +  y2  +  z.2  +  2xy 

can  be  rewritten  as 

pi(x,y,z.)  =  (x  +  y)2  +  z2, 

so  pi  is  nonnegative.  Since  pi (1,-1,0)  =  0,  p\  is  positive  semidefinite  and  —p\  is 
negative  semidefinite. 

The  polynomial 

p2(x,y,z)  =  x2  —  y2  +  z.2 
is  not  semidefinite,  since,  for  example, 

p2(l,0,0)  =  l    and    p2(0, 1,0)  =1.  ■ 

From  Theorem  5.3.1 1,  if  /  is  differentiable  and  attains  a  local  extreme  value  at  Xo,  then 

dxQf  =  0.  (5.4.37) 

since  fxi  (Xo)  =  fX2  (Xo)  =  •  •  •  =  fx„  (Xo)  =  0.  However,  the  converse  is  false.  The  next 
theorem  provides  a  method  for  deciding  whether  a  point  satisfying  (5.4.37)  is  an  extreme 
point.  It  is  related  to  Theorem  2.5.3. 
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Theorem  5.4.10  Suppose  that  f  satisfies  the  hypotheses  of  Theorem  5.4.9  with  k  > 
2,  and 

<V-0    (l<r<*-l),    <V#0.  (5.4.38) 

Then 

(a)  Xo  is  not  a  local  extreme  point  of  f  unless       f  is  semidefinite  as  a  polynomial  in 
X  —  Xo .  In  particular,  Xo  is  not  a  local  extreme  point  of  f  ifk  is  odd. 

(b)  Xo  is  a  local  minimum  point  of  f  if       f  is  positive  definite,  or  a  local  maximum 
point  if       f  is  negative  definite. 

(c)  If  d^  f  is  semidefinite,  then  Xo  may  be  a  local  extreme  point  of  f,  but  it  need  not 
be. 

Proof   From  (5.4.38)  and  Theorem  5.4.9, 

f(X)-f(X0)-^-(d^)(X-X0) 

lim   =  0.  (5.4.39) 

x^xo  |X-X0|fc 

If  X  =  Xo  +  t U,  where  U  is  a  constant  vector,  then 


so  (5.4.39)  implies  that 


(d^f)(X-X0)  =  tk(d^f)(U), 


fk 

1    ■  ,(k) 


/(Xo  +  fU)  -  /(Xo)  -  -«oV)(U) 

lim  =  ^   =  0. 

t^O  tk 

or,  equivalently, 

lim  /<*>  +  *;>-/(*>>  =  1  idg  f  W)  (5.4.40) 

t^O  tk  k\  X° 

for  any  constant  vector  U. 

To  prove  (a),  suppose  that  f  is  not  semidefinite.  Then  there  are  vectors  Ui  and  U2 
such  that 

(4o}/)(Ui)>0    and  (4^/)(U2)<0. 
This  and  (5.4.40)  imply  that 

/(X0  +  rtJi)  >  /(X0)    and    /(X0  +  fU2)  <  /(X„) 

for  f  sufficiently  small.  Hence,  Xo  is  not  a  local  extreme  point  of  /. 

To  prove  (b),  first  assume  that  d^  f  is  positive  definite.  Then  it  can  be  shown  that 
there  is  a  p  >  0  such  that 

(4fc)/)(X-X0) 

K-*ilA_   -  Plx_xol  (5.4.41) 

k\ 
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for  all  X  (Exercise  5.4.19).  From  (5.4.39),  there  is  a  S  >  0  such  that 
/(X)  -  /(X0)  -  1       f)(X  -  X0) 

  fc!     0  if  |X-X0|<«. 

\X-X0\k  2  1  1 

Therefore, 

/(X)-/(Xo)>I(4^)(X-Xo)-^|X-X0|*    if  |X-X0|<«. 

This  and  (5.4.41)  imply  that 

/(X)-/(X0)>^|X-X0|fc    if  |X-X„|<S. 

which  implies  that  Xo  is  a  local  minimum  point  of  /.  This  proves  half  of  (b).  We  leave 
the  other  half  to  you  (Exercise  5.4.20). 

To  prove  (c)  merely  requires  examples;  see  Exercise  5.4.21.  TJ 

Corollary  5.4.11  Suppose  that  f,  fx,  and  fy  are  differentiable  in  a  neigborhood  of  a 
critical  point  Xq  =  (xo,  yo)  of  f  and  fxx,  fyy,and  fxy  are  continuous  at  (xo,  y  a).  Let 

D  =  fxx{xo,  yo)fxy(x0,  y0)  -  fxy(x0,  yo). 

Then 

(a)  (xo,  yo)  is  a  local  extreme  point  of  f  if  D  >  0;  (xo,  yo)  is  a  local  minimum  point  if 
fxx(xo,  yo)  >  0,  or  a  local  maximum  point  if  fxxixo,  yo)  <  0. 

(b)  (xo,  yo)  is  not  a  local  extreme  point  of  f  if  D  <  0. 

Proof   Write  {x  —  xo,  y  —  yo)  =  (m,  v)  and 

p(u,  v)  =  (d^f)(u,  v)  =  Au2  +  2Buv  +  Cv2, 
where  A  =  fxx(xo,  yo),  B  =  fxy(x0,  yo),  and  C  =  fyy(x0,  yo),  so 

D  =  AC  -B2. 
If  D  >  0,  then  A  ^  0,  and  we  can  write 

p(u,  v)  =  A  I  u2  +  —uv  +  —  v2  J  +  I  C  -  —  I  v2 

J  B    V        D  2 

=  A{U+AV)  +AV- 

This  cannot  vanish  unless  u  =  v  =  0.  Hence,  d^'f  is  positive  definite  \f  A  >  0  or 
negative  definite  if  A  <  0,  and  Theorem  5.4.10(b)  implies  (a). 
If  D  <  0,  there  are  three  possibilities: 
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1.  A  +  0;  then  p(l,  0)  =  A  and  /?  f— ^,  1  j  =  ^. 

2.  C  ^  0;  then  /?(0,  1)  =  C  and    M ,  ~J  =  ^. 


3.  .4  =  C  =  0;  then  5  ^  0  and  /?(!,  1)  =  25  and  p(l,-l)  =  -2B. 


In  each  case  the  two  given  values  of  p  differ  in  sign,  so  Xo  is  not  a  local  extreme  point 
of  /,  from  Theorem  5.4.10(a).  H 

Example  5.4.11  If 

f(x,y)  =  eax2+by2, 

then 

fx(x,  y)  =  2axf(x,  y),    fy(x,  y)  =  2byf(x,  y), 

so 

/*(D,0)  =  /„(0,0)  =  0, 
and  (0,  0)  is  a  critical  point  of  /.  To  apply  Corollary  5.4.1 1,  we  calculate 

fxx(x,  y)  =  (2a  +  4a2x2)f(x,  y), 

fyy(x,y)=(2b  +  4b2y2)f(x,y), 

fxy(x,y)  =  Aabxyf(x,y). 

Therefore, 

D  =  fxx(0, 0)fyy(0, 0)  -  f2y(0. 0)  =  (2o)(26)  -  (0)(0)  =  4ab. 

Corollary  5.4. 1 1  implies  that  (0,  0)  is  a  local  minimum  point  if  a  and  b  are  positive,  a  local 
maximum  if  a  and  b  are  negative,  and  neither  if  one  is  positive  and  the  other  is  negative. 
Corollary  5.4.1 1  does  not  apply  if  a  or  b  is  zero. 


5.4  Exercises 


In  the  exercises  on  the  use  of  the  chain  rule,  assume  that  the  functions  satisfy  appropriate 
differentiability  conditions. 

1.     Under  the  assumptions  of  Theorem  5.4.3,  show  that  Uo  is  an  interior  point  of  the 
domain  of  h . 
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2.  Let  h(\J)  =  /(G(U))  and  find  d\]0h  by  Theorem  5.4.3,  and  then  by  writing  h 
explicitly  as  a  function  of  U. 

(a)  f(x,  y)  =  3x2  +  Axy2  +  3x, 

gi(u,v)  =  veu+v-\  («o,  «o)  =  (0, 1) 

g2(u,v)  =  e~u+v-\ 

(b)  f(x,y,z)  =  e-(x+y+z), 

gi(u,V,w)  =   logM-logU  +  logW,  ...  \          /i    ,  n 

g2(u,v,w)  =  —  21ogw  —  31ogw;, 

gj,{u,V,U))  =  log  U  +  log  V  +  2  log  U! , 

(c)  =  (*  +  y)2, 

gi(u,v)  =  wcostj,      (mo,  Do)  =  (3,  tt/2) 
g"2(":  v)  =  u  sin  u, 

(d)  f(x,y,z)  =  x2  +  y2  +  z.2, 

gi(u,v,w)  =  w  cost;  sin  uj, 

;         :  (uo,  u0,  wo)  =  (4, 7T/3,  7T/6) 

g2\Mi  V,W)  =  U  COS  I?  COS  If, 

g3(w,  u,  w)  =  m  sin  u; 

3.  Let  /j(r,  0,  z)  =  f(x,  y,  z),  where  x  =  r  cos  0  and  y  =  r  sin  0.  Find  hr,  hg,  and 
/jz  in  terms  of      /y,  and  fz. 

4.  Let /z(r,  (9,  0)  =  f(x,  y,  z),  where  x  =  r  sin^  cos  (9,  j  =  r  sin  0  sin  (9,  and  z  = 
r  cos  0.  Find  hr,  hg,  and  /z^,  in  terms  of  /x, and  /z. 

5.  Prove: 

(a)  If  h(u,  v)  =  f  (u2  +  v2),  then  vhu  —  uhv  =  0. 

(b)  If  h(u,v)  =  /(sin u  +  cos  i>),  then  hu  sin  v  +  hv  cos  u  =  0. 

(c)  If  /z(w,  u)  =  f(u/v),  then  m/zm  +  u/zu  =  0. 

(d)  If  h(u,  v)  =  f(g(u,  v),  -g(u,  i>)),  then  dh  =  (fx  -  fy)  dg. 

6.  Find  hy  and  hz  if 

h(y,  z.)  =  g(x(y,  z),  y,  z,  w{y,  z)). 

7.  Suppose  that  w,  v,  and  /  are  defined  on  (—00,  00).  Let  u  and  v  be  differentiable 
and  /  be  continuous  for  all  x.  Show  that 

L  fV0°  f{t)  dt  =  f(v(x))v'(x)  -  f(u(x))u'(x). 

8.  We  say  that  /  =  f(x\,  X2, . . . ,  x„)  is  homogeneous  of  degree  r  if  D  f  is  open  and 
there  is  a  constant  r  such  that 


f{tx\,tx2, txn)  =  tr  f(xi,x2,  ■■■  ,x„) 
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whenever  t  >  0  and  (xi,  x%, . . .  ,xn)  and  (tx\ ,  tX2,  ■  ■  ■ ,  tx„)  are  in  D  f .  Prove:  If 
/  is  differentiable  and  homogeneous  of  degree  r,  then 

77 

^Xi  fXiiXi,Xz,  . . .  ,X„)  =  rf(Xi,X2,...,Xn). 

!  =  1 

(This  is  Euler's  theorem  for  homogeneous  functions.) 
9.    If  h(r,  9)  =  fir  cos  9,  r  sin  9),  show  that 

fxx  +  fyy  =  hrr  +  -hr  +  —^IQQ. 

Hint:  Rewrite  the  defining  equationas  f  (x,  y)  =  h(r(x,  y),  9(x,  y)),  withr(x,  y)  = 
y/ x2  +  y2  and  9(x,  y)  =  ian~l{y/x),  and  differentiate  with  respect  to  x  andy. 

10.  Let  h(u,  v)  =  f(a(u,  v),  b(u,  v)),  where  au  =  bv  and  av  =  —bu.  Show  that 

huu  +  hvv  =  (fxx  +  fyy){al  +  a2v). 

11.  Prove:  If 

u(x,t)  =  f(x  —  ct)  +  g(x  +  ct), 

then  utt  =  c2uxx. 

12.  Let  h(u,  v)  =  f(u  +  v,u  —  v).  Show  that 

(a)  fxx  -  fyy  =  huv     (b)  fxx  +  fyy  =  ^(huu  +  hvv) 

13.  Returning  to  Exercise  5.4.4,  find  hrr  and  hrg  in  terms  of  the  partial  derivatives  of 
/• 

14.  Let  huv  =  0  for  all  (m,  v).  Show  that  h  is  of  the  form 

h(u,  v)  =  U(u)  +  V{v). 
Use  this  and  Exercise  5.4.12(a)  to  show  that  if  fxx  —  fyy  =  0  for  all  (x,  y),  then 
f(x,y)  =  U(x  +  y)  +  V(x-y). 

15.  Prove  or  give  a  counterexample:  If  /  is  differentiable  and  fx  =  0  in  a  region  D, 
then  f(xi,  y)  =  f(xz,  y)  whenever  (xi,  y)  and  (x^,  y)  are  in  D;  that  is  f(x,  y) 
depends  only  on  y. 

16.  Findr3(X). 

(a)  f{x,  y)  =  ex  cos  y,    X0  =  (0, 0) 

(b)  fix,y)  =  e-x~y,   Xo  =  (0,0) 

(c)  fix,y,z)=  (x  +  y  +  z-3)\    X0  =  (1, 1, 1) 

(d)  fix,  y,  z)  =  sinx  siny  sinz,    Xo  =  (0,  0,  0) 

17.  Use  Eqns.  (5.4.23)  and  (5.4.32)  to  prove  Eqn.  (5.4.35). 
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18.  Carefully  explain  why  Theorem  5.4.9  is  not  a  generalization  of  Theorem  2.5. 1 . 

19.  Suppose  that  p  is  a  homogeneous  polynomial  of  degree  r  in  Y  and  p(Y)  >  0  for  all 
nonzero  Y  in  W.  Show  that  there  is  a  p  >  0  such  that  p(Y)  >  p|Y|r  for  all  Y  in 
W .  Hint:  p  assumes  a  minimum  on  the  set  {Y  |  |  Y|  =  1 }  .  Use  this  to  establish  the 
inequality  inEqn.  (5.4.41). 

20.  Complete  the  proof  of  Theorem  5.4.10(b). 

21.  (a)    Show  that  (0,  0)  is  a  critical  point  of  each  of  the  following  functions,  and  that 

they  have  positive  semidefmite  second  differentials  at  (0.  0). 

p(x,  y)  =  x2  -  2xy  +  y2  +  x4  +  y4; 
q(x,  y)  =  x2  —  2xy  +  y2  —  x4  —  y4 . 

(b)  Show  that  D  as  defined  in  Corollary  5.4.1 1  is  zero  for  both  p  and  q. 

(c)  Show  that  (0,  0)  is  a  local  minimum  point  of  p  but  not  a  local  extreme  point 
of  q. 

22.  Suppose  that  p  =  p(x\,  %i, . . . ,  xn)  is  a  homogeneous  polynomial  of  degree  r 
(Exercise  5.4.8).  Let  i\,  i%,  ...,/„  be  nonnegative  integers  such  that 

h  +  iz  H  1-  in  =  k, 

and  let 

dkp(x1,x2,  ...,x„) 

q(Xl,X2,  ■  ■  ■  ,  Xn)  =   :  :  :  • 

Sx'j1  dx1^  •  ■  ■  dxln" 

Show  that  q  is  homogeneous  of  degree  <  r  —  k,  subject  to  the  convention  that  a 
homogeneous  polynomial  of  negative  degree  is  identically  zero. 

23.  Suppose  that  /  =  f(x\,  x2, xn)  is  a  homogeous  function  of  degree  r  (Exer- 
cise 8),  with  mixed  partial  derivative  of  all  orders.  Show  that 

A  d2f(xl,x2,...,xn) 

>    XiXj  —  =  r(r  -  l)f(xi,x2,...,xn) 

t—1  axi  ox  j 

and 

A  33(^i,x2.  ...,xn)        .      ru      ....  , 

>     xtXjXk  5—5 — 5  =  r(r  -  l)(r  -  2)f(x\,x2,...,xn). 

.  TT* ,  oxi  dx  j  dxk 

Can  you  generalize  these  results? 

24.  Obtain  the  result  in  Example  5.4.7  by  writing 

F(X)  =  e-"^ie-"2X2...e-a„x„^ 

formally  multiplying  the  series 

e-atx,  =  J^i-iy^-,  l<i<n 


together,  and  collecting  the  resulting  products  appropriately. 
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25.  Let 

f(x,y)  =  ex+y. 

By  writing 


f(x,y)  =  2^ — — 


r=0 

and  expanding  (x  +  y)r  by  means  of  the  binomial  theorem,  verify  that 

r\  dr /(0,0) 


CHAPTER  6 


Vector- Valued  Functions 
of  Several  Variables 


IN  THIS  CHAPTER  we  study  the  differential  calculus  of  vector-valued  functions  of  several 
variables. 

SECTION  6.1  reviews  matrices,  determinants,  and  linear  transformations,  which  are  inte- 
gral parts  of  the  differential  calculus  as  presented  here. 

SECTION  6.2  defines  continuity  and  differentiability  of  vector-valued  functions  of  several 
variables.  The  differential  of  a  vector-valued  function  F  is  defined  as  a  certain  linear  trans- 
formation. The  matrix  of  this  linear  transformation  is  called  the  differential  matrix  of  F, 
denoted  by  F'.  The  chain  rule  is  extended  to  compositions  of  differentiable  vector- valued 
functions. 

SECTION  6.3  presents  a  complete  proof  of  the  inverse  function  theorem. 

SECTION  6.4.  uses  the  inverse  function  theorem  to  prove  the  implicit  function  theorem. 


6.1  LINEAR  TRANSFORMATIONS  AND  MATRICES 


In  this  and  subsequent  sections  it  will  often  be  convenient  to  write  vectors  vertically;  thus, 
instead  of  X  =  (xi ,  Xi, . . . ,  x„)  we  will  write 


Xl 

x2 


when  dealing  with  matrix  operations.  Although  we  assume  that  you  have  completed  a 
course  in  linear  algebra,  we  will  review  the  pertinent  matrix  operations. 

We  have  defined  vector- valued  functions  as  ordered  n -tuples  of  real-valued  functions,  in 
connection  with  composite  functions  h  =  f  o  G,  where  /  is  real-valued  and  G  is  vector- 
valued.  We  now  consider  vector-valued  functions  as  objects  of  interest  on  their  own. 


361 


362    Chapter  6  Vector-Valued  Functions  of  Several  Variables 


If  /i>  fi,  •  •  • .  fm  are  real-valued  functions  denned  on  a  set  £>  in  W,  then 

/i 

/2 


assigns  to  every  X  in  D  an  m-vector 


F(X)  = 


/i(X) 
/2(X) 

/m(X) 


Recall  that  f\ ,  fa,  . . . ,  fm  are  the  component  functions,  or  simply  components,  of  F.  We 
write 

F  :  K"  ->  Rm 

to  indicate  that  the  domain  of  F  is  in  W  and  the  range  of  F  is  in  Km .  We  also  say  that  F  is  a 
transformation  from  W  to  Km .  If  m  =  1 ,  we  identify  F  with  its  single  component  function 
fx  and  regard  it  as  a  real-valued  function. 


Example  6.1.1  The  transformation  F  :  R2 

F(x,y)  = 


defined  by 


2x  +  3y 
-x  +  Ay 
x-y 


has  component  functions 

fi(x,y)  =  2x  +  3y,    f2(x,  y)  = -x  +  4y,    f3(x,y)  =  x-y. 

Linear  Transformations 

The  simplest  interesting  transformations  from  W  to  Rm  are  the  linear  transformations, 
defined  as  follows 


Definition  6.1.1  A  transformation  L  :  W  ->  Rm  defined  on  all  of  W  is  linear  if 

L(X  +  Y)  =  L(X)  +  L(Y) 

for  all  X  and  Y  in  W  and 


L(aX)  =  aL(X) 


for  all  X  in  W  and  real  numbers  a. 
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Theorem  6.1.2  A  transformation^  :  Mn  M! 
only  if 

anxi  +  ai2x2  + 


defined  on  all  ofW  is  linear  if  and 


L(X) 


"T  d\nxn 
+  a2nxn 


a2\X\  +  a22x2  + 

amlxl  +  ®m2x2  +  * "  *  +  amnxn 


(6.1.1) 


where  the  ay 's  are  constants. 


Proof   If  can  be  seen  by  induction  (Exercise  6.1.1)  that  if  L  is  linear,  then 
L(aiXi  +  a2X2  +  ■■■  +  akXk)  =  aiL(Xx)  +  a2L(X2)  +  ■■■  +  akL{Xk) 


(6.1.2) 


for  any  vectors  Xi,  X2, 
written  as 


X  = 


X^  and  real  numbers  a\,  a2,  . . .,  ak.  Any  X  in  W  can  be 


Xi 

'  1 

'  0  " 

'  0 

x2 

0 

1 

0 

=  xl 

+  x2 

H  h  xn 

xn 

0 

0 

1 

=  XiEi  +  x2E2  H  h  xnEn. 

Applying  (6.1.2)  with  k  =  n,  X,-  =  E,-,  and  a,-  =  x,  yields 

L(X)  =  XiL(Ei)  +  x2L(E2)  +  ■  ■  ■  +  XnL(E„). 

Now  denote 


(6.1.3) 


L(Ei)  = 


ay 
a2j 


so  (6.1.3)  becomes 


an 

a\2 

am 

a21 

a22 

a2„ 

X\ 

+  x2 

H  h  xn 

am\ 

Um2 

L(X)  =  xi 


which  is  equivalent  to  (6.1.1).  This  proves  that  if  L  is  linear,  then  L  has  the  form  (6.1.1). 
We  leave  the  proof  of  the  converse  to  you  (Exercise  6.1.2).  H 

We  call  the  rectangular  array 


an  «i2 
a2\  a2\ 

am\  am2 


0-2n 


(6.1.4) 
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the  matrix  of  the  linear  transformation  (6.1.1).  The  number  a,y  in  the  ith  row  and  y'th 
column  of  A  is  called  the  (i,  j  )th  entry  of  A.  We  say  that  A  is  an  m  x  n  matrix,  since  A 
has  m  rows  and  n  columns.  We  will  sometimes  abbreviate  (6.1.4)  as 

A  =  [fly]. 

Example  6.1.2  The  transformation  F  of  Example  6.1.1  is  linear.  The  matrix  of  F  is 


We  will  now  recall  the  matrix  operations  that  we  need  to  study  the  differential  calculus 
of  transformations. 

Definition  6.1.3 

(a)  If  c  is  a  real  number  and  A  =  [a//]  is  an  m  x  n  matrix,  then  cA  is  the  m  x  n  matrix 
defined  by 

cA  =  [cau]; 

that  is,  cA  is  obtained  by  multiplying  every  entry  of  A  by  c. 

(b)  If  A  =  [dij]  and  B  =  [bij]  are  m  x  n  matrices,  then  the  sum  A  +  B  is  the  m  x  n 
matrix 

A  +  B  =  [aij+bij]; 

that  is,  the  sum  of  two  m  x  n  matrices  is  obtained  by  adding  corresponding  entries. 
The  sum  of  two  matrices  is  not  defined  unless  they  have  the  same  number  of  rows  and 
the  same  number  of  columns. 

(c)  If  A  =  [an]  is  an  m  x  p  matrix  and  B  =  [bij]  is  a  p  x  n  matrix,  then  the  product 
C  =  AB  is  the  m  x  n  matrix  with 


anby  +  ai2b2j  H  h  aipbpj  =  ^  aikbkj,     1  <  i  <  m,  1  <  j  <  n. 

k=l 


Thus,  the  (i,  j  )th  entry  of  AB  is  obtained  by  multiplying  each  entry  in  the  i'th  row  of 
A  by  the  corresponding  entry  in  the  y'th  column  of  B  and  adding  the  products.  This 
definition  requires  that  A  have  the  same  number  of  columns  as  B  has  rows.  Otherwise, 
AB  is  undefined. 


Example  6.1.3  Let 

A  = 

and 


2    1  2 
-10  3 
0  1 


0 


B 


0    1  1 
-10  2 
3    0  1 


1  2 
-3  1 


10-11 
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Then 


and 


2A  = 


A  +  B 


2(2)  2(1)  2(2) 

2(-l)  2(0)  2(3) 

2(0)  2(1)  2(0) 

2  +  0  1  +  1  2+1 

-1-1  0  +  0    3  +  2 

0+3  1+0  0+1 


4    2  4 
-2    0  6 
0    2  0 

2  2  3 
-2    0  5 

3  1  1 


The  (2,  3)  entry  in  the  product  AC  is  obtained  by  multiplying  the  entries  of  the  second 
row  of  A  by  those  of  the  third  column  of  C  and  adding  the  products:  thus,  the  (2,  3)  entry 
of  AC  is 

(-1)(1)  +  (0)(-3)  +  (3)(-l)  =  -4. 

The  full  product  AC  is 


2    1  2 
-10  3 
0    1  0 


5  0 
3  0 
1  0 


1  2 
-3  1 
-1  1 


15  0 
-2  0 
3  0 


-3  7 
-4  1 
-3  1 


Notice  that  A  +  C,  B  +  C,  CA,  and  CB  are  undefined.  ■ 

We  leave  the  proofs  of  next  three  theorems  to  you  (Exercises  6.1.7-6.1.9) 

Theorem  6.1.4  If  A,  B,  and  C  are  m  x  n  matrices,  then 

(A  +  B)  +  C  =  A  +  (B  +  C). 

Theorem  6.1.5  If A  andR  are  mxn  matrices  and r  and s  are  real  numbers,  then  (a) 
r(sA)  =  (rs)A;  (b)  (r  +  s)A  =  rA  +  sA;  (c)  r(A  +  B)  =  rA  +  rB. 

Theorem  6.1.6  If  A,  B,  and  C  are  m  x  p,  p  x  q,  and  q  x  n  matrices,  respectively, 
then  (AB)C  =  A(BC). 

The  next  theorem  shows  why  Definition  6.1.3  is  appropriate.  We  leave  the  proof  to  you 
(Exercise  6.1.11). 

Theorem  6.1.7 


(a)   If  we  regard  the  vector 


X 


A'2 


as  ann  x  1  matrix,  then  the  linear  transformation  (6.1.1)  can  be  written  as 

L(X)  =  AX. 
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(b)  TfLi  and  L2  are  linear  transformations  from  W  to  M.m  with  matrices  A\  and  A2 
respectively,  then  C1L1  +  C2L2  is  the  linear  transformation  from  M"  to  M.m  with 
matrix  C\K\  +  c2A2. 

(c)  //"Li  :  R"  — >  Rp  and  L2  :  Rp  — >  Mm  are  linear  transformations  with  matrices  Ai 
ant/  A2,  respectively,  then  the  composite  function  L3  =  L2  o  Li ,  defined  by 

L3(X)  =L2(L1(X)), 

/s  f/ze  Z/near  transformation  from  W  to  Rm  with  matrix  A2Ai . 

Example  6.1.4  If 


Li(X) 


then 


A,  = 


2x  +  3y 
3x  +  2y 
-x  +  y 

2  3 

3  2 
-1  1 


and  L2(X) 


and    A2  = 


—x  —  y 
4x  +  y 

x 


-1  -1 
4  1 
1  0 


The  linear  transformation 
is  defined  by 


L  =  2Li  +  L2 


L(X)  =  2L!(X)  +  L2(X) 


2x  +  3y 

-x-y 

2 

3x  +  2y 

+ 

4x  +  y 

-x  +  y 

X 

The  matrix  of  L  is 


Example  6.1.5  Let 


and 


A  = 


3x  +  5y 
lOx  +  5y 
-x  +  2y 

3    5  " 
10  5 
-1  2 


=  2Ai  +  A2. 


Li(X)  = 


L2(U)  = 


x  +  2y 
3x  +  Ay 

u  +  v 
—u  —  2v 
3u  +  v 


Section  6. 1  Linear  Transformations  and  Matrices  367 


Then  L3  =  L2  o  Li  :  R2      R3  is  given  by 

L3(X)  =La((Li(X))  = 
The  matrices  of  Li  and  L2  are 
Ai 

respectively.  The  matrix  of  L3  is 

C  = 


(x  +  2y)  +  (3jc  +  4j) 
-(x  +  2j)  -  2(3*  +  4j) 
3(x  +  2j)  +  (3jc  +  4j) 


4jc  +  6j 
— 7jc  -  lOy 
6x  +  lOy 


1  2 
3  4 


and 


1  1 
-1  -2 
3  1 


4  6 
7  -10 
6  10 


=  A2A1. 


Example  6.1.6  The  linear  transformations  of  Example  6.1.5  can  be  written  as 


Li(X)  = 


1  2 
3  4 


x 

y 


and 


L3(X)  = 


,    L2(U)  = 


4  6 
-7  -10 
6  10 


1  1 
-1  -2 
3  1 


A' 

y 


A  New  Notation  for  the  Differential 

If  a  real-valued  function  /  :  W  — >■  R  is  differentiable  at  Xo,  then 

^x0/  =  /jci  (Xo)  dxi  +  /x2(X0)  fi?x2  H  h  /*„  (X0)  rfx„. 

This  can  be  written  as  a  matrix  product 


<W  =  [/*i(Xo)    A2(X0)    ■■■  A„(X0)] 


<ixi 
<ix2 


We  define  the  differential  matrix  of  f  at  Xo  by 

/'(X0)  =  [/X1(X0)    A2(X0)    •■•  A„(X0)] 
and  the  differential  linear  transformation  by 


dX  = 


dx\ 
dx2 


(6.1.5) 


(6.1.6) 
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Then  (6.1.5)  can  be  rewritten  as 


dXof  =  f'(X0)dX. 


(6.1.7) 


This  is  analogous  to  the  corresponding  formula  for  functions  of  one  variable  (Exam- 
ple 5.3.7),  and  shows  that  the  differential  matrix  /'(Xo)  is  a  natural  generalization  of  the 
derivative.  With  this  new  notation  we  can  express  the  defining  property  of  the  differential 
in  a  way  similar  to  the  form  that  applies  for  n  =  1 : 

,.  /(X)-/(X0)-/'(Xo)(X-Xq) 

hm   ■  ■  =  0, 

x^x0  |X  —  X0| 

where  Xo  =  (xio,  X20,  ■  ■  ■ ,  x„o)  and  /'(Xo)(X  —  Xo)  is  the  matrix  product 
[fxl(X0)    fX2(X0)     •••  fXn(X0)] 


Xl  —  XlO 
X%  —  X20 


Xyi  XnQ 

As  before,  we  omit  the  Xo  in  (6.1.6)  and  (6.1.7)  when  it  is  not  necessary  to  emphasize 
the  specific  point;  thus,  we  write 


/'  =  [/*,  f> 
Example  6.1.7  If 


••■     fXn]     and     df  =  f'dX. 

f(x,y,z.)  =  4x2yz3, 


then 


f'(x,y,z)  =  [8xyz3    4xV  \2x2yz2\. 
In  particular,  if  Xo  =  (1,-1,2),  then 

/'(X0)  =  [-64    32  -48], 


dXof  =  f'(X0)dX=  [-64    32  -48] 


dx 
dy 
dz 

-64  dx  +  32  dy-  48  dz. 


The  Norm  of  a  Matrix 

We  will  need  the  following  definition  in  the  next  section. 

Definition  6.1.8  The  norm,  ||  A|| ,  of  an  m  x  n  matrix  A  =  [ay]  is  the  smallest  number 
such  that 

|AX|  <  ||A||  |X| 

for  all  X  in  W.  ■ 
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To  justify  this  definition,  we  must  show  that  ||A||  exists.  The  components  of  Y  =  AX 
are 

yt  =  cii\X\  +  ai2x2  H  1-  ainxn,     1  <  i  <  m. 

By  Schwarz's  inequality, 

yf  <  (fl?i  +af2  +  ---  +  afn)\X\2. 
Summing  this  over  1  <  i  <  m  yields 

(m       n  \ 

Therefore,  the  set 

5  =  {i<:  |  |AX|  <  ^|X|    for  all  X  in  M"} 

is  nonempty.  Since  B  is  bounded  below  by  zero,  B  has  an  infimum  a.  If  €  >  0,  then  a  +  € 
is  in  5  because  if  not,  then  no  number  less  than  a  +  e  could  be  in  B.  Then  a  +  e  would  be 
a  lower  bound  for  B,  contradicting  the  definition  of  a.  Hence, 

|AX|  <  (a  +  e)|X|,  XeM". 

Since  e  is  an  arbitrary  positive  number,  this  implies  that 

|AX|<a|X|,  XeM", 

so  a  €  B.  Since  no  smaller  number  is  in  B,  we  conclude  that  ||A||  =  a. 

In  our  applications  we  will  not  have  to  actually  compute  the  norm  of  a  matrix  A;  rather, 
it  will  be  sufficient  to  know  that  the  norm  exists  (finite). 


Square  Matrices 

Linear  transformations  from  K"  to  MP  will  be  important  when  we  discuss  the  inverse  func- 
tion theorem  in  Section  6.3  and  change  of  variables  in  multiple  integrals  in  Section  7.3. 
The  matrix  of  such  a  transformation  is  square;  that  is,  it  has  the  same  number  of  rows  and 
columns. 

We  assume  that  you  know  the  definition  of  the  determinant 


det(A)  = 


of  an  n  x  n  matrix 


«n  a\2 

Onl  a„2 


ann 


din 
a2n 


ann 
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The  transpose,  A' ,  of  a  matrix  A  (square  or  not)  is  the  matrix  obtained  by  interchanging 
the  rows  and  columns  of  A;  thus,  if 


1  2 
3  1 
0  1 


then  A' 


1  3 

2  1 

3  4 


A  square  matrix  and  its  transpose  have  the  same  determinant;  thus, 

det(Ar)  =  det(A). 

We  take  the  next  theorem  from  linear  algebra  as  given. 

Theorem  6.1.9  If  A  andB  are  n  x  n  matrices,  then 

det(AB)  =  det(A)det(B). 

The  entries  an,  1  <  i  <  n,  of  an  n  x  n  matrix  A  are  on  the  main  diagonal  of  A.  The  nxn 
matrix  with  ones  on  the  main  diagonal  and  zeros  elsewhere  is  called  the  identity  matrix  and 
is  denoted  by  I;  thus,  if  n  =  3, 


I  = 


1  0  0 
0  1  0 
0    0  1 


We  call  I  the  identity  matrix  because  AI  =  A  and  IA  =  A  if  A  is  any  nxn  matrix.  We 
say  that  an  n  x  n  matrix  A  is  nonsingular  if  there  is  an  n  x  n  matrix  A-1 ,  the  inverse  of  A, 
such  that  AA_1  =  A-1  A  =  I.  Otherwise,  we  say  that  A  is  singular 

Our  main  objective  is  to  show  that  an  n  x  n  matrix  A  is  nonsingular  if  and  only  if 
det(A)  7^  0.  We  will  also  find  a  formula  for  the  inverse. 

Definition  6.1.10  Let  A  =  [ay]  be  an  n  x  n  matrix,  with  n  >  2.  The  cofactor  of  an 
entry  ay  is 

Cij  =  det(Ay), 

where  Ay  is  the  (n  —  1)  x  (n  —  1)  matrix  obtained  by  deleting  the  zth  row  and  y'th  column 
of  A.  The  adjoint  of  A,  denoted  by  adj(A),  is  the  n  x  n  matrix  whose  (z,  j')th  entry  is  c/;. 


Example  6.1.8  The  cofactors  of 


2  1 
-1  2 
1  2 
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are 


Cll 


C21 


C31  = 


-1  2 

1  2 

2  1 

1  2 

2  1 
-1  2 


Cl2 


C22 


SO 


=    5,    c32  =  - 


adj(A) 


3  2 
0  2 

4  1 

0  2 

4  1 

3  2 


Cl3 


8,  c23 


=  -5,    c33  = 


-4    -3  5 
-6      8  -5 
3    -4  -10 


3  -1 
0  1 

4  2 
0  1 


=  -io, 


Notice  that  adj(A)  is  the  transpose  of  the  matrix 

-4-6  3 
-3      8  -4 
5    -5  -10 


obtained  by  replacing  each  entry  of  A  by  its  cofactor. 

For  a  proof  of  the  following  theorem,  see  any  elementary  linear  algebra  text. 


Theorem  6.1.11  Let  A  be  an  n  x  n  matrix. 

(a)  The  sum  of  the  products  of  the  entries  of  a  row  of  A.  and  their  cof actors  equals  det(A) , 
while  the  sum  of  the  products  of  the  entries  of  a  row  of  A  and  the  cofactors  of  the 
entries  of  a  different  row  equals  zero;  that  is, 

E"  (det(A),    i  =  j, 

aikCjk=>0^  (6.1.8) 

(b)  The  sum  of  the  products  of  the  entries  of  a  column  of  A.  and  their  cofactors  equals 
det(A),  while  the  sum  of  the  products  of  the  entries  of  a  column  of  A  and  the  cofactors 
of  the  entries  of  a  different  column  equals  zero;  that  is, 

£^,  =  {oet(A)'  (6-L9) 
k=i       1 '      r  j~ 

If  we  compute  det(A)  from  the  formula 

n 

det(A)  =  ^aikcik, 
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we  say  that  we  are  expanding  the  determinant  in  cof actors  of  its  i  th  row.  Since  we  can 
choose  i  arbitrarily  from  {1, . . . ,  «},  there  are  n  ways  to  do  this.  If  we  compute  det(A) 
from  the  formula 

n 

det(A)  =  ^akjCkj, 

k=l 

we  say  that  we  are  expanding  the  determinant  in  cofactors  of  its  j  th  column.  There  are  also 
n  ways  to  do  this. 

In  particular,  we  note  that  det(I)  =  1  for  all  n  >  1 . 

Theorem  6.1.12  Let  A  be  an  n  x  n  matrix.  //det(A)  =  0,  then  A  is  singular.  If 
det(A)  7^  0,  then  A  is  nonsingular,  and  A  has  the  unique  inverse 

A_1  =  adJ^-  (6-L10) 

det(A) 

Proof  If  det(A)  =  0,  then  det(AB)  =  0  for  any  n  x  n  matrix,  by  Theorem  6.1.9. 
Therefore,  since  det(I)  =  1,  there  is  no  matrix  n  x  n  matrix  B  such  that  AB  =  I;  that  is,  A 
is  singular  if  det(A)  =  0.  Now  suppose  that  det(A)  ^  0.  Since  (6. 1 .8)  implies  that 

Aadj(A)  =  det(A)I 

and  (6.1.9)  implies  that 

adj(A)A  =  det(A)I, 

dividing  both  sides  of  these  two  equations  by  det(A)  shows  that  if  A"1  is  as  defined  in 
(6.1.10),  then  AA"1  =  A_1A  =  I.  Therefore,  A"1  is  an  inverse  of  A.  To  see  that  it  is  the 
only  inverse,  suppose  that  B  is  an  n  x  n  matrix  such  that  AB  =  I.  Then  A-1  (AB)  =  A-1 , 
so  (A"1  A)B  =  A"1 .  Since  AA"1  =  I  and  IB  =  B,  it  follows  that  B  =  A-1.  U 


Example  6.1.9  In  Example  6.1.8  we  found  that  the  adjoint  of 


is 


adj(A)  = 


4  2  1 
3-12 
0      1  2 


-4    -3  5 
-6      8  -5 
3    -4  -10 


We  can  compute  det(A)  by  finding  any  diagonal  entry  of  Aadj(A).  (Why?)  This  yields 
det(A)  =  -25.  (Verify.)  Therefore, 


A     =  — 


1 

25 


-3  5 
8  -5 
-4  -10 
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Now  consider  the  equation 

AX  =  Y  (6.1.11) 

with 


a\2  ■ 

Cl\n 

Xi 

yi 

A  = 

021 

a22  ■ 

■  a2„ 

,    x  = 

x2 

,     and     Y  = 

yi 

<2«2  - 

Xn  _ 

_  y» 

Here  A  and  Y  are  given,  and  the  problem  is  to  find  X. 

Theorem  6.1.13  The  system  (6.1.1 1)  has  a  solution  Xfor  any  given  Y  if  and  only  if 
A  is  nonsingular.  In  this  case,  the  solution  is  unique  and  is  given  by\  =  A-1  Y. 

Proof    Suppose  that  A  is  nonsingular,  and  let  X  =  A-1  Y.  Then 

AX  =  A(A_1Y)  =  (AA~:)Y  =  IY  =  Y; 

that  is,  X  is  a  solution  of  (6.1.11).  To  see  that  X  is  the  only  solution  of  (6.1.11),  suppose 
that  AXi  =  Y.  Then  AXX  =  AX,  so 

A"1  (AX)  =  A_1(AXi) 

and 

(A_1A)X  =  (A~:A)Xi, 

which  is  equivalent  to  IX  =  IX i ,  or  X  =  Xi. 

Conversely,  suppose  that  (6.1.11)  has  a  solution  for  every  Y,  and  let  X,  satisfy  AX,  = 
E,-,  1  <  i  <  n.  Let 

B  =  [Xi  X2  ■  •  ■  X„]; 
that  is,  Xi,  X2,  •  •  • ,  X„  are  the  columns  of  B.  Then 

AB  =  [AXi  AX2  •  ■  •  AX„]  =  [Ei  E2  •  ■  •  E„]  =  I. 

To  show  that  B  =  A-1,  we  must  still  show  that  BA  =  I.  We  first  note  that,  since  AB  =  I 
and  det(BA)  =  det(AB)  =  1  (Theorem  6.1.9),  BA  is  nonsingular  (Theorem  6.1.12).  Now 
note  that 

(BA)(BA)  =  B(AB)A)  =  BIA; 

that  is, 

(BA)(BA)  =  (BA). 

Multiplying  both  sides  of  this  equation  on  the  left  by  BA)-1  yields  BA  =  1.  TJ 

The  following  theorem  gives  a  useful  formula  for  the  components  of  the  solution  of 
(6.1.11). 
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Theorem  6.1.14  (Cramer's  Rule)  If  A  =  [atj]  is  nonsingular,  then  the  solu- 
tion of  the  system 

anxi  +  ai2x2  H  h  a\nx„  =  y\ 

a2\xY  +  a22x2  H  h  a2nxn  =  y2 


an\Xi  +  an2x2  H  h  a„„*„  =  y„ 

{or,  in  matrix  form,  AX  =  Y)  is  given  by 


Xj  = 


det(A) ' 


1  <  i  <  n. 


where  Di  is  the  determinant  of  the  matrix  obtained  by  replacing  the  ith  column  of  A.  with 
Y;  thus, 


Di  = 


yi  «i2 
yi  a22 

yn  an2 


a2n 


D?  = 


an    yi  -an 

tf/il      yn  (InZ 


a2n 


an    ■■■    a\yn-\  yi 

«21      ■■■      «2,n-l  J2 


@nl      '  '  '      &n,n—l  yn 

Proof   From  Theorems  6.1.12  and  6.1.13,  the  solution  of  AX  =  Y  is 


Xl 

x2 


=  A1Y  = 


1 


det(A) 


Cll  C21 
Cl2  c22 

C\n  C2n 


Cnl 
Cn2 


yi 

3'2 

y» 


cnyi  +  c21y2  H  h  cnij„ 

Cl2jl  +  C22j2  H  h  c„2y« 

Clnjl  +  C2„J2  H  1-  c„„y„ 


But 


cnyi  +  C2U2  H  h  c„ijn  = 


)>i  ai2  •••  am 
>>2    a22    •••  02« 
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as  can  be  seen  by  expanding  the  determinant  on  the  right  in  cofactors  of  its  first  column. 
Similarly, 


c\iy\  +  c22yi  H  1-  cn2y„ 


an 
an 


yi  an 
yi  «23 


&2n 


®n\      yn  Qn3 


as  can  be  seen  by  expanding  the  determinant  on  the  right  in  cofactors  of  its  second  column. 
Continuing  in  this  way  completes  the  proof.  H 

Example  6.1.10  The  matrix  of  the  system 

\x  +  2y  +  z.  =  1 
3x  —  y  +  2z,  =  2 
y+2z  =  0 


A  = 


2  1 
-1  2 
1  2 


Expanding  det(A)  in  cofactors  of  its  first  row  yields 
det(A)  =  4 


-1  2 

3 

2 

+  1 

3  -1 

1  2 

-2 

0  1 

0 

2 

=  4(_4)_2(6)+  1(3)  =  -25. 
Using  Cramer's  rule  to  solve  the  system  yields 


1 

25 


1  2  1 
2-12 
0      1  2 


Z  =  ~25 


1 


5' 

y  = 

25 

4 

2 

1 

3 

-1 

2 

0 

1 

0 

A  system  of  n  equations  in  n  unknowns 


anxi  +  ai2X2  H  1-  a\nxn  =  0 

a2iXi  +  022X2  H  1-  a2nxn  =  0 

an\X\  +  an2x2  H  1-  annxn  =  0 


4 

1 

1 

3 

2 

2 

0 

0 

2 

(6.1.12) 


(or,  in  matrix  form,  AX  =  0)  is  homogeneous.  It  is  obvious  that  Xo  =  0  satisfies  this 
system.  We  call  this  the  trivial  solution  of  (6.1.12).  Any  other  solutions  of  (6.1.12),  if  they 
exist,  are  nontrivial. 
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We  will  need  the  following  theorems.  The  proofs  may  be  found  in  any  linear  algebra 
text. 

Theorem  6.1.15  The  homogeneous  system  (6.1.12)  ofn  equationsinn  unknowns  has 
a  nontrivial  solution  if  and  only  ;/det(A)  =  0. 

Theorem  6.1.16  If  A\,  A%, . . . ,  Ak  are  nonsingularnxn  matrices,  thenso  is  A1A2  ■  ■  ■  Ak, 
and 

(A1A2---Aky1  =  A^A^—Ai1. 
6.1  Exercises 


2. 
3. 


5. 


Prove:  If  L  :  W  — >•  M.m  is  a  linear  transformation,  then 

L(axXi  +  a2X2  +  ■■■  +  akXk)  =  fliL(Xi)  +  a2L(X2)  +  ■  •  ■  +  akL(Xk) 

if  Xi,  X2, . . . ,  Xfc  are  in  K"  and  a\,  a2,  ■  ■  ■ ,  ak  are  real  numbers. 
Prove  that  the  transformation  L  defined  by  Eqn.  (6.1.1)  is  linear. 
Find  the  matrix  of  L. 


(a)  L(X)  = 

Find  cA. 
(a)  c  =  4, 
Find  A  +  B 


3x  +  4y  +  6z. 
2x  -  47  +  2z 
Ix  +  2y  +  3z 


(b)  L(X)  = 


2xi  +  Ax2 
3>x\  —  2x2 
lx\  —  4x2 
6x1  +  x2 


A  = 


6 
3 
11 


(b)  c  =  -2,  A 


1  3 

0  1 

1  -1 


"  -1 

2 

3  " 

1 

0 

3 

(a)A  = 

1 

1 

4 

,    B  = 

5 

6 

-7 

0  - 

1 

4 

0 

-1 

2 

"  0    5  " 

"  -1 

2 

(b)A  = 

3  2 

B 

0 

3 

1  7 

4 

7 

6.    Find  AB. 


"  -1      2  3" 

"  -1 

2  " 

(a)A  = 

0      1  4 

,    B  = 

0 

3 

0-14 

4 
1  " 

7 

(b)A  = 

"  5    3    2    1  " 
6    7    4  1 

,    B  = 

3 
4 
7 
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7. 

8. 

9. 
10. 
11. 
12. 


13. 
14. 
15. 


16. 


17. 
18. 
19. 
20. 


Prove  Theorem  6.1.4. 
Prove  Theorem  6.1.5. 
Prove  Theorem  6. 1 .6. 

Suppose  that  A  +  B  and  AB  are  both  defined.  What  can  be  said  about  A  and  B? 

Prove  Theorem  6. 1 .7. 

Find  the  matrix  of  aLi  +  bh2- 

3x  +  2y  +  z. 


(a)   U(x,y,z.)  = 


L2(x,y,z)  = 


(b)  U(x,y) 


x  +  4y  +  2z. 
3x-4y  +  z. 

-x  +  y  -  z 
-2x  +  y  +  3z 
y  +  z 


a  =2,    b  =  -\ 


2x  +  3y 
x-  y 
4x  +  y 


L2(x,  y)  = 


3x  —  y 
x  +  y 
-x-y 


a  =  4, 


Find  the  matrices  of  L10L2  andL^oLi,  where  Li  andL2  are  as  in  Exercise  6.1.12(a). 
Write  the  transformations  of  Exercise  6.1.12  in  the  form  L(X)  =  AX. 
Find  /'  and  f'(X0). 

(a)  f(x,y,z)  =  3x2yz„    X0  =  (1,-1,1) 

(b)  f(x,y)  =  sm(x  +  y),    X„  =  (tt/4,  jt/4) 

(c)  f(x,y,z)  =  xye-xz,    X0  =  (1,2,0) 

(d)  f(x,  y,z)  =  tan(jt  +  2y  +  z),    X0  =  (tt/4,  -tt/8,  tt/4) 

(e)  /(X)  =  |X|  :  W  -¥  R,   X0  =  (l/«Jn,  l/</n  l/«Jn) 

Let  A  =  [ciij]  be  an  m  x  n  matrix  and 


A 


max  \ \ai 


1  <  i  <  m,  1  <  i  <  n}  . 


Show  that  || A ||  <  X^/mn. 

Prove:  If  A  has  at  least  one  nonzero  entry,  then  ||  A||  ^  0 
Prove:  ||A  +  B||  <  ||A||  +  ||B||. 
Prove:  ||AB||  <  ||A||  ||B||. 
Solve  by  Cramer's  rule. 

x  +   y  +  2z  =    1  x  +  y 


(a)   2x  -  y  +  z  =  -l 
x  —  2y  —  3z,  =  2 


z.  =  5 
(b)    3x-2y  +  2z=  0 
4x  +  2y  -  3z,  =  14 
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22. 


x  +  2y  +  3z  =  -5 
(c)   x         -  z  =  -1 
x  +   y  +  2z  =  -4 


(d) 


x  —  y  +  z  —  2w=  1 
2x+   y  -  3z  +  3w  =  4 


3x  +  2y 
2x  +  y 


+   w=  13 
=  4 


21.     Find  A  1  by  the  method  of  Theorem  6.1.12. 


(a) 


(c) 


(e) 


1 

3 

4 

3 
0 


-2 
4 


2  1 
-1  2 
1  2 


12  0  0 

-2    3  0  0 

0  0  2  3 
0  0-12 


(b) 


(d) 


(f) 


1 

2 

3 

1 

0 

-1 

1 

1 

2 

1 

0 

1  " 

0 

1 

1 

1 

1 

0 

1 

1 

2 

-1 

2 

2 

-1 

3 

-1 

4 

1 

2 

3 

1 

0 

1 

For  1  <  i,j  <  m,  let  an  =  aij(K)  be  a  real-valued  function  continuous  on  a 
compact  set  K  in  W .  Suppose  that  the  m  x  m  matrix 

A(X)  =  [%-(X)] 

is  nonsingular  for  each  X  in  K,  and  define  the  m  x  m  matrix 

B(X,Y)=  [bij(X,Y)] 

by 

B(X,Y)  =  A~1(X)A(Y)  —  I. 
Show  that  for  each  e  >  0  there  is  a  S  >  0  such  that 

\bij(X,Y)\  <e,  \<i,j<m, 

if  X,  Y  e  K  and  |X  —  Y|  <  S.  Hint:  Show  thatbij  is  continuous  on  the  set 

{(X,  Y)  |  X  €  K,  Y  €  K}. 

Then  assume  that  the  conclusion  is  false  and  use  Exercise  5.1.32  to  obtain  a  contradiction. 


6.2  CONTINUITY  AND  DIFFERENTIABILITY  OF  TRANS- 
FORMATIONS 


Throughout  the  rest  of  this  chapter,  transformations  F  and  points  X  should  be  considered  as 
written  in  vertical  form  when  they  occur  in  connection  with  matrix  operations.  However, 
we  will  write  X  =  (xy,  Xz, . . . ,  Xn)  when  X  is  the  argument  of  a  function. 
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Continuous  Transformations 

In  Section  5.2  we  defined  a  vector- valued  function  (transformation)  to  be  continuous  at  Xo 
if  each  of  its  component  functions  is  continuous  at  Xo.  We  leave  it  to  you  to  show  that  this 
implies  the  following  theorem  (Exercise  1). 

Theorem  6.2.1  Suppose  thatXo  is  in,  and  a  limit  point  of  the  domain  o/F  :  M"  — > 
M.m .  Then  F  is  continuous  at  Xq  if  and  only  if  for  each  e  >  0  there  is  a  8  >  0  such  that 


|F(X)  -F(X0)|  <  e    if    |X-X0|<<5    and    X  e  DF. 


(6.2.1) 


This  theorem  is  the  same  as  Theorem  5.2.7  except  that  the  "absolute  value"  in  (6.2.1) 
now  stands  for  distance  in  Rm  rather  than  ML 

If  C  is  a  constant  vector,  then  "limx->x0         =  C"  means  that 

lim  |F(X)  -C|  =  0. 

X->Xo 

Theorem  6.2. 1  implies  that  F  is  continuous  at  Xo  if  and  only  if 

lim  F(X)  =  F(X0). 

x^x0 


Example  6.2.1  The  linear  transformation 

L(X)  = 

is  continuous  at  every  Xo  in  M3,  since 
L(X)-L(X0)  =L(X-Xo)  = 


x  +  y  +  z 
2x  -  3y  +  z, 
2x  +  y  -  z 


(x  -  x0)  +  (y  -  y0)  +  (z,  -  z0) 
2(x  -  x0)  -  3(y  -  y0)  +  (z,  -  z0) 
2(x  -  x0)  +  (y  -  yo)  -  (z  -  zo) 

and  applying  Schwarz's  inequality  to  each  component  yields 

|L(X)-L(X0)|2  <  (3  +  14+  6)|X-X0|2  =  23|X-X0|2. 

Therefore, 

|L(X)-L(Xo)|  <e    if    |X-Xo|  < 


Differentiable  Transformations 

In  Section  5.4  we  defined  a  vector- valued  function  (transformation)  to  be  differentiable  at 
Xo  if  each  of  its  components  is  differentiable  at  Xo  (Definition  5.4.1).  The  next  theorem 
characterizes  this  property  in  a  useful  way. 
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Theorem  6.2.2  A  transformation^  =  (f\,  fj, . . . ,  fm)  defined  in  a  neighborhood  of 
Xq  e  W  is  differentiable  at  Xq  if  and  only  if  there  is  a  constant  m  x  n  matrix  A  such  that 


lim 

x^x0 


F(X)-F(X0)-A(X-Xo) 


|X-X„| 

If '(6.2.2)  holds,  then  A  is  given  uniquely  by 


A  = 


.  9*/  - 


0. 


9/i(X0) 

3/i(X0) 

9/i(X0) 

3X2 

3jc„ 

3/2  (X0) 

9/2(X0) 

9/2(X0) 

3*2 

3*« 

3/m(X0) 

9/m(X0) 

9/m(Xo) 

3*2 

3*« 

(6.2.2) 


(6.2.3) 


Proof   Let  Xo  =  (xio,  *20,  ■  ■  ■ ,  x„o).  If  F  is  differentiable  at  Xo,  then  so  are  /i,  f?., 
fm  (Definition  5.4.1).  Hence, 


^9/i(X0) 


lim 

x^x0 


.7  =  1 


dxj 


(Xj  -Xj0) 


IX -Xn 


0,     1  <  z  <  m, 


which  implies  (6.2.2)  with  A  as  in  (6.2.3). 

Now  suppose  that  (6.2.2)  holds  with  A  =  [fly].  Since  each  component  of  the  vector  in 
(6.2.2)  approaches  zero  as  X  approaches  Xq,  it  follows  that 


lim 

x^x0 


/<09-/i(Xo)- J>/(jCy-JCyo) 
./  =  ! 


IX -X 


0,     1  <  z  <  m, 


o 


so  each  fi  is  differentiable  at  Xo,  and  therefore  so  is  F  (Definition  5.4.1).  By  Theo- 
rem 5.3.6, 

9/}(X0)      .     .  .  . 

1  <  z  <  zw,     1  <  j  <  n, 


an  = 


dxj 


which  implies  (6.2.3). 
A  transformation  T  :  W 


H 


of  the  form 


T(X)  =  U  +  A(X  -  X0), 

where  U  is  a  constant  vector  in  Rm ,  Xo  is  a  constant  vector  in  W ,  and  A  is  a  constant  m  x  n 
matrix,  is  said  to  be  affine.  Theorem  6.2.2  says  that  if  F  is  differentiable  at  Xo,  then  F  can 
be  well  approximated  by  an  affine  transformation. 
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Example  6.2.2  The  components  of  the  transformation 

F(X)  = 


x2  +  2xy  +  z 
x  +  2xz,  +  y 


x2  +  y2  +  z2 

are  differentiable  at  Xo  =  (1,  0,  2).  Evaluating  the  partial  derivatives  of  the  components 
there  yields 

"22  1 
A  =      5  12 
_  2    0  4 

(Verify).  Therefore,  Theorem  6.2.2  implies  that  the  affine  transformation 
T(X)  =  F(X0)  +  A(X  -  X0) 

~  ~  x  -  1 


"  3  " 

5 

+ 

5 

satisfies 


2  2  1 
5  1  2 
2    0  4 


F(X)-T(X) 
lim  — — — —  =  0. 


y 

Z-2 


x^xo  |X-X0| 
Differential  of  a  Transformation 

If  F  =  (f\ ,  fz,  fm)  is  differentiable  at  Xo,  we  define  the  differential  ofF  at  Xo  to  be 
the  linear  transformation 

dxQfi 


dx0fm 


(6.2.4) 


We  call  the  matrix  A  in  (6.2.3)  the  differential  matrix  o/F  at  Xo  and  denote  it  by  F'(Xo); 
thus, 

9/i(X0)     3/i(X0)  3/i(X0) 


F'(Xq)  = 


dx\  3x2 
3/2(X0)  3/2(X0) 


dx\ 


dx? 


3/m(Xo)  3/m(X0) 


dxn 
3/2(X0) 
3x„ 


3/m(X0) 


dx\ 


dx? 


dx„ 


(6.2.5) 
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(It  is  important  to  bear  in  mind  that  while  F  is  a  function  from  W  to  M.m,  F'  is  not  such 
a  function;  F'  is  an  m  x  n  matrix.)  From  Theorem  6.2.2,  the  differential  can  be  written  in 
terms  of  the  differential  matrix  as 


dXoF  =  F'(Xq) 


dX2 

dXyi 


(6.2.6) 


or,  more  succinctly,  as 
where 


dXoF  =  F'(Xq)  dX, 


dX  = 


dx\ 

dX2 


as  defined  earlier. 

When  it  is  not  necessary  to  emphasize  the  particular  point  Xq,  we  write  (6.2.4)  as 


dF  = 


dfi 
df% 

dfm 


(6.2.5)  as 


F'  = 


dA 

Ml  ' 

dxi 

dx2 

dx„ 

dh 

dh 

dx\ 

dx2 

dxn 

Mm 

Mm 

dxi 

3X2 

dxn  _ 

dF 

=  F'  dX. 

With  the  differential  notation  we  can  rewrite  (6.2.2)  as 

F(X)  -  F(X0)  -  F'(X0)(X  -  X0) 

lim   ;  ;  

x->x0  |X-X0| 
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Example  6.2.3  The  linear  transformation 


F(X) 


anxi  +  a\2X2  H  1-  a\nxn 

a2\X\  +  a2iX2  H  \- a2nxn 


&m\X\  +  Clm2X2  +  '  '  '  +  amnxn 

can  be  written  as  F(X)  =  AX,  where  A  =  [ay].  Then 

F'  =  A; 


that  is,  the  differential  matrix  of  a  linear  transformation  is  independent  of  X  and  is  the 
matrix  of  the  transformation.  For  example,  the  differential  matrix  of 


is 


F(xi,x2,x3) 


F'  = 


1  2  3 

2  1  0 


1  2  3 

2  1  0 


Xl 

x2 

X3 


If  F(X)  =  X  (the  identity  transformation),  then  F'  =  I  (the  identity  matrix). 
Example  6.2.4  The  transformation 


F(x,jO  = 


x2  +  y2 

y 

x2  +  y2 
2xy 


is  differentiable  at  every  point  of  R2  except  (0,  0),  and 


y2-x2 
(x2  +  y2)2 
2xy 


2xy 


(x2  +  y2)2 

2  2 

x  —  y 


(x2  +  y2)2      (x2  +  y2)2 


2y 


2x 


In  particular, 


0  -i 


F'(l,l) 
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so 


lim 


(*oO-KU)  y/(x  -  l)2  +  (y  -  l)2 


V 


-  1  - 

2 

1 

2 

2 

o  -I 

2  2 


x  -  1 
y-l 


If  m  =  n,  the  differential  matrix  is  square  and  its  determinant  is  called  the  Jacobian  of 
F.  The  standard  notation  for  this  determinant  is 


d(fi,f2,...,fn) 


3(xi,x2, 


,x„) 


Ml 

Ml 

dx\ 

dx2 

dx„ 

Ml 

Ml 

dxi 

dx2 

dxn 

Wn_ 

dA 

Ml 

dx\ 

dx2 

dx„ 

We  will  often  write  the  Jacobian  of  F  more  simply  as  /(F),  and  its  value  at  Xo  as  /F(Xo). 

Since  an  n  x  n  matrix  is  nonsingular  if  and  only  if  its  determinant  is  nonzero,  it  follows 
that  if  F  :  W  W  is  differentiable  at  Xo,  then  F'(X0)  is  nonsingular  if  and  only  if 
/F(Xq)  7^  0.  We  will  soon  use  this  important  fact. 


Example  6.2.5  If 


then 


d(xi,x2,x3) 


/F(X) 


x2  -  2x  +  z 
x  +  2xy  +  z.2 
x  +  y  +  z 


2x-2     0  1 
1  +  2y    2x  2z. 
11  1 


(2x  -  2) 


2x  2z 
1  1 


+ 


1  +  2y  2x 
1  1 


=  (2x  -  2){2x  -  2z)  +  (1  +  2y  -  2x). 
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In  particular,  /F(l,  —1,  1)  =  —3,  so  the  differential  matrix 

F'(l,-1,  1)  = 

is  nonsingular. 


0  0  1 
-12  2 

1  1  1 


Properties  of  Differentiable  Transformations 

We  leave  the  proof  of  the  following  theorem  to  you  (Exercise  6.2.16). 

Theorem  6.2.3  IfF  :  R"  — >  M.m  is  differentiable  at  Xo,  then  F  is  continuous  at  Xo. 

Theorem  5.3.10  and  Definition  5.4.1  imply  the  following  theorem. 

Theorem  6.2.4  Let  F  =  (/i,  f2, . . . ,  fm)  :  W  M.m,  and  suppose  that  the  partial 
derivatives 

l<i<m,     l<j<n,  (6.2.7) 

dxj 

exist  on  a  neighborhood  of  Xq  and  are  continuous  at  Xo.  Then  F  is  differentiable  at  Xo- 

We  say  that  F  is  continuously  differentiable  on  a  set  S  if  S  is  contained  in  an  open  set 
on  which  the  partial  derivatives  in  (6.2.7)  are  continuous.  The  next  three  lemmas  give 
properties  of  continuously  differentiable  transformations  that  we  will  need  later. 

Lemma  6.2.5  Suppose  thatF  :  W  —>  M.m  is  continuously  differentiable  on  a  neigh- 
borhood N  o/Xo.  Then,  for  every  €  >  0,  there  is  a  8  >  0  such  that 

|F(X)-F(Y)|  <  (||F'(Xo)||  +e)|X-Y|    if  A,  Y  e  BS(X0).  (6.2.8) 

Proof   Consider  the  auxiliary  function 

G(X)  =  F(X)  -  F'(X0)X.  (6.2.9) 

The  components  of  G  are 

U  x 

so 

3g,-(X)  _  dfj(X)  dfiXo) 
dxj  dxj  dxj 
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Thus,  dgi/dxj  is  continuous  on  N  and  zero  at  Xq.  Therefore,  there  is  a  8  >  0  such  that 


3xy 


< 


for     1  <  i  <  m,     l<j<  /i,    if    |X-  X0|  <  8.  (6.2.10) 


Now  suppose  that  X,  Y  e  fi«(X0).  By  Theorem  5.4.5, 

J=i  0XJ 


(6.2.11) 


where  X;  is  on  the  line  segment  from  X  to  Y,  so  X;  6  B$(Xo).  From  (6.2.10),  (6.2.11), 
and  Schwarz's  inequality, 


(gi(X)-gi(Y))2  <  \J2 


dxj 


~l2  ' 


|X-Y|2  <  —  |X-Y|2. 
m 


Summing  this  from  i  =  1  to  i  =  m  and  taking  square  roots  yields 

|G(X)-G(Y)|  <e|X-Y|    if  X,Ye^(X0). 

To  complete  the  proof,  we  note  that 

F(X)  -  F(Y)  =  G(X)  -  G(Y)  +  F'(X0)(X  -  Y), 

so  (6.2.12)  and  the  triangle  inequality  imply  (6.2.8). 


(6.2.12) 

(6.2.13) 
U 


Lemma  6.2.6  Suppose  that  F  :  W  —>  W  is  continuously  differentiable  on  a  neigh- 
borhood of      andF'(Ko)  is  nonsingular.  Let 


1 


(6.2.14) 


IICF'CXo))-1!! 

Then,  for  every  e  >  0,  there  is  a  8  >  0  such  that 

|F(X)-F(Y)|  >  (r-e)|X-Y|    if  X,  Y  e  BS(X0).  (6.2.15) 

Proof   Let  X  and  Y  be  arbitrary  points  in  DF  and  let  G  be  as  in  (6.2.9).  From  (6.2.13), 

(6.2.16) 


Since 
(6.2.14)  implies  that 


|F(X)-F(Y)|  >  ||F'(X0)(X-Y)|-|G(X)-G(Y)||, 
X-Y=  [F'(Xo)]-1F'(X0)(X-Y). 


|X-Y|  <  -|F'(X0)(X-Y|, 


|F'(X0)(X-Y)|  >  r|X-Y|.  (6.2.17) 
Now  choose  8  >  0  so  that  (6.2.12)  holds.  Then  (6.2.16)  and  (6.2.17)  imply  (6.2.15).  U 
See  Exercise  6.2.19  for  a  stronger  conclusion  in  the  case  where  F  is  linear. 
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Lemma  6.2.7  IfF  :  R"  — ►  M.m  is  continuously  differentiable  on  an  open  set  containing 
a  compact  set  D,  then  there  is  a  constant  M  such  that 

|F(Y)-F(X)|  <  M|Y-X|    if   X,Y  £  D.  (6.2.18) 

Proof  On 

S  =  {(X,  Y)  |  X,  Y  e  D)  c  m2" 

define 

r|F(Y)-F(X)-F'(X)(Y-X)| 
g(X,Y)=\  |Y-X|  '       ^  ' 

(O,  Y  =  X. 

Then  g  is  continuous  for  all  (X,  Y)  in  5  such  that  Y.  We  now  show  that  if  Xo  e  D, 
then 


lim       g(X,Y)  =  0  =  g(Xo,Xo); 

(X,Y)^(X(),Xo) 


(6.2.19) 


that  is,  g  is  also  continuous  at  points  (Xo,  Xo)  in  S. 

Suppose  that  e  >  0  and  Xo  e  D,  Since  the  partial  derivatives  of  f\,  f%,  fm  are 
continuous  on  an  open  set  containing  D,  there  is  a  S  >  0  such  that 


3/,  (Y)  9/HX) 


dxj  dxj 


Imn 


if  X,Ye  BS(X0),  1  <  i  <m,  1  <  j  <  n.  (6.2.20) 


(Note  that  dfi/dxi  is  uniformly  continuous  on  5,5 (Xo)  for  8  sufficiently  small,  from  The- 
orem 5.2.14.)  Applying  Theorem  5.4.5  to /i,  f2  /ffl,we  find  that  if  X,  Y  £  Bg(X0), 

then 

MY)-MX)  =  f2d-^-(yj-xj), 
./=i  3 

where  X,  is  on  the  line  segment  from  X  to  Y.  From  this, 

-1  2 


ftM-fiW-J^-iyj-xj) 

7  =  1  ■/ 


E 


dxj 


dxj 


(y.i-xj) 


\Y-X\2J2 


^— '  I  dxj  dx; 
j  =  i 

(by  Schwarz's  inequality) 


<  — |Y-X| 
m 


(by  (6.2.20)) 

Summing  from  i  =  1  to  (  =  m  and  taking  square  roots  yields 

|F(Y)-F(X)-F'(X)(Y-X)|  <e|Y-X|    if   X,Y  £  BS(X0). 
This  implies  (6.2.19)  and  completes  the  proof  that  g  is  continuous  on  S. 
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Since  D  is  compact,  so  is  S  (Exercise  5.1.27).  Therefore,  g  is  bounded  on  S  (Theo- 
rem 5.2.12);  thus,  for  some  Mi, 

|F(Y)-F(X)-F'(X)(Y-X)|  <Mi|X-Y|    if  XJeO. 

But 

|F(Y)  -  F(X)|  <  |F(Y)  -  F(X)  -  F'(X)(Y  -  X)|  +  |F'(X)(Y  -  X)| 
<(M1  +  ||F(X)||)|(Y-X|. 

Since 

\  1/2 
2  < 


if'(x)ii<(EE 

,!  =  !  7  =  1 


dXj  J 


and  the  partial  derivatives  {3yi/3x7}  are  bounded  on  D,  it  follows  that  ||F'(X)||  is  bounded 
on  D ;  that  is,  there  is  a  constant  M2  such  that 

||F'(X)||  <  M2,  XeD. 

Now  (6.2.21)  implies  (6.2.18)  with  M  =  Mi  +  M2.  TJ 

The  Chain  Rule  for  Transformations 

By  using  differential  matrices,  we  can  write  the  chain  rule  for  transformations  in  a  form 
analogous  to  the  form  of  the  chain  rule  for  real-valued  functions  of  one  variable  (Theo- 
rem 2.3.5). 

Theorem  6.2.8  Suppose  that  F  :  E"  -»  W"  is  differentiable  atX0,  G  :  Rk  W  is 
differentiable  at  Uo,  and  Xo  =  G(Uo).  Then  the  composite  function  H  =  F  o  G  :  Kfe  — > 
Rm,  defined  by 

H(U)  =  F(G(U)), 

is  differentiable  at  Uo  -  Moreover, 

H'(U0)  =  F'(G(Uo))G(U0)  (6.2.22) 

and 

dVotL  =  dXoFo  dVoG,  (6.2.23) 

where  o  denotes  composition. 

Proof   The  components  of  H  are  hi,  h2,  •  •  • ,  hm,  where 

hi(U)  =  /,(6(U)). 
Applying  Theorem  5.4.3  to  hi  yields 

dVohi  =  ^2  ^    °  dVogj,     1  <  i  <  m.  (6.2.24) 

7=1  J 
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Since 


dVoH 


dVohi 
dVoh2 


and    d\jQ  G 


the  m  equations  in  (6.2.24)  can  be  written  in  matrix  form  as 


d\]0g\ 
dv0gi 

dv0gn 


But 

where 


dVoll  =  F'(X0)dVoG  =  F'(G(U0))^u0G. 
dVoG  =  G'(VQ)dV, 


(6.2.25) 


dU  = 


du\ 

du-i 

diik 


so  (6.2.25)  can  be  rewritten  as 

rfu0H  =  F'(G(Uo))G'(Uo)^U. 

On  the  other  hand, 

dVofl  =  H'(U0)<iU. 

Comparing  the  last  two  equations  yields  (6.2.22).  Since  G'(Uo)  is  the  matrix  of  d\j0G  and 
F'(G(U0))  =  F'(X0)  is  the  matrix  of  dXoF,  Theorem  6.1.7(c)  and  (6.2.22)  imply  (6.2.23). 

n 


Example  6.2.6  LetU0  =  (1,-1). 
G(U)  =  G(m,  »)  =      Vw2  +  3i;2 


F(X)  =F(x,y,z)  = 


x2  +  y2  +  2z' 


2  2 

x  —  y 


and 

H(U)  =  F(G(U)). 
Since  G  is  differentiable  at  Uo  =  (1,-1)  and  F  is  differentiable  at 

X0  =  G(U0)  =  (1,2,1), 

Theorem  6.2.8  implies  that  H  is  differentiable  at  (1,  —1).  To  find  H'(l,  —1)  from  (6.2.22), 
we  first  find  that 
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G'(U)  = 


1 

2^/u 
u 


0 

3v 


Vw2  +  3u2     Vw2  +  3u2 
1 

0 


2V^  +  2  - 


and 


Then,  from  (6.2.22), 


F'(X)  = 


2x  2y  4z 
2x    -2y  0 


2  4  4 
2-4  0 


-1) 

-  1 

2 

0 

1 

3 

2 

2 

0 

1 

2  - 

3  -4 
-1  6 


We  can  check  this  by  expressing  H  directly  in  terms  of  (u,  v)  as 

,  2 


H(m,  i;) 


(V^)2  -  (vM2  +  3u2)2 


u  +  u2  +  3v2  +  2v  +  4 
u  —  u2  —  3v2 


and  differentiating  to  obtain 


which  yields 


as  we  saw  before. 


H>,  u)  = 
H'(l,-1)  = 


1+2m  6u  +  2 
1  —  2m      — 6i> 


3  -4 
-1  6 


6.2  Exercises 


1.     Show  that  the  following  definitions  are  equivalent. 

(a)   F  =  (/i ,  fz, fm)  is  continuous  at  Xo  if  f\ ,  /2,  . . . ,  /m  are  continuous 
atX0. 
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2. 


3. 

4. 
5. 

6. 
7. 


8. 
9. 


(b)   F  is  continuous  at  Xo  if  for  every  e  >  0  there  is  a  5  >  0  such  that  |F(X) 
F(X0)|  <  e  if  |X  -  X0|  <  S  and  X  e  DF. 

Verify  that 

F(X)  -  F(X0)  -  F'(X0)(X  -  Xo)  n 

hm   ■  ■   =  0. 

x^x0  X-X0 


(a)  F(X)  = 

(b)  F(X)  = 

(c)  F(X)  = 


3x  +  Ay 
2x  —  y 
x  +  y 


X0  =  (x0,  y0,Zo) 


2x2  +  xy  +  1 

xy 
x2  +  y2 


sin(x  +  y) 
sin(j  +  z) 
sin(x  +  z) 


Xo  =  (1,-1) 


X0  =  (tt/4,  0,  jt/4) 


Suppose  that  F  :  R"  Rm  and  /j  :  R"  — >■  R  have  the  same  domain  and  are 
continuous  at  Xo.  Show  that  the  product  h¥  =  (hfi ,  hfz, . . . ,  hfm)  is  continuous  at 
X0. 

Suppose  that  F  and  G  are  transformations  from  R"  to  Rm  with  common  domain  D . 
Show  that  if  F  and  G  are  continuous  at  Xo  e  D,  then  so  are  F  +  G  and  F  —  G. 

Suppose  that  F  :  R"  — ►  Rm  is  defined  in  a  neighborhood  of  Xo  and  continuous  at 
X0,  G  :  Rk  -+  R"  is  defined  in  a  neighborhood  of  Uo  and  continuous  at  Uo,  and 
Xo  =  G(Uo).  Prove  that  the  composite  function  H  =  F  o  G  is  continuous  at  Uo- 

Prove:  If  F  :  R"      Rm  is  continuous  on  a  set  S,  then  |F|  is  continuous  on  S. 

Prove:  If  F  :  R"  — >  Rm  is  continuous  on  a  compact  set  S,  then  |F|  is  bounded  on 
S,  and  there  are  points  Xo  and  Xi  in  S  such  that 

|F(X0)|  <  |F(X)|  <  |F(XO|,  XeS; 

that  is,  |F|  attains  its  infimum  and  supremum  on  5.  Hint:  Use  Exercise  6.2.6. 

Prove  that  a  linear  transformation  L  :  R"  — >•  Rm  is  continuous  on  R" .  Do  not  use 
Theorem  6.2.8. 

Let  A  be  an  m  x  n  matrix. 

(a)    Use  Exercises  6.2.7  and  6.2.8  to  show  that  the  quantitites 


M(A)  =  max 


|AX] 
IXI 


X  ^  0|     and     m(A)  =  min 


IAXI 


X^O 


exist.  Hint:  Consider  the  functionL(Y)  =  AY  on  S  =  {Y  |  |Y|  =  1} . 
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(b)  Show  that  M(A)  =  ||A||. 

(c)  Prove:  If  n  >  m  or  n  =  m  and  A  is  singular,  then  m(A)  =  0.  (This  requires  a 
result  from  linear  algebra  on  the  existence  of  nontrivial  solutions  of  AX  =  0.) 

(d)  Prove:  If  n  =  m  and  A  is  nonsingular,  then 


m(A)M(A_1)  =  m(A~1)M(A)  =  1. 

10.  We  say  that  F  :  W  ->  Rm  is  uniformly  continuous  on  S  if  each  of  its  components 
is  uniformly  continuous  on  S.  Prove:  If  F  is  uniformly  continuous  on  S,  then  for 
each  €  >  0  there  is  a  S  >  0  such  that 


|F(X)-F(Y)|  <e 


if    |X  -  Y|  <  S    and    X,  Y  e  S. 

11 

in  W. 

12.     Find  F'  and  /F.  Then  find  an  affine  transformation  G  such  that 


Show  that  if  F  is  continuous  on  W  and  F(X  +  Y)  =  F(X)  +  F(Y)  for  all  X  and  Y 
in  W",  then  A  is  linear.  HINT:  The  rational  numbers  are  dense  in  the  reals. 


lim  F(X)'G(Y)  =  0. 
x^Xo     X  —  Xq 


(a)  F(x,y,z)  = 

(b)  ¥(x,y)  = 

(c)  ¥(x,y,z)  = 

13.  FindF'. 

(a)  F(x,y,z)  = 

(c)  ¥(x,y,z)  = 

14.  Find  F'  and  /F. 

(a)  F(r,fl) 

(c)  F(r,  6,  z.)  = 


x2  +  y  +  2z 
cos(x  +  y  +  z) 


„xyz 


e  cos  y 
ex  sin  y 

2  2 

xA  —  yA 

2  2 

yz  -  z 

2  2 

z  -  XA 


Xq  =  (1,-1,0) 
Xo  =  (0,  it/2) 
Xo  =  (1,1,1) 


(x  +  y  +  z)ex 
(x2  +  y2)e~x 


e  sin  yz, 
ey  sinxz 
ez  sinxj 


(b)  F(x) 


g2(x) 
gn(x) 


r  cos  6 
r  sin  (9 

r  cos  6 
r  sin  (9 

z 


(b)  F(r,  9,  <p) 


r  cos  6  cos  (p 
r  sin  6  cos  (p 
r  sin0 
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15.     Prove:  If  Gi  and  G2  are  affine  transformations  and 

Gj(X)-G2(Y) 
lim   ■  ■        =  0, 


x^-Xo      IX  —  Xq 


thenGi  =  G2. 
1 6 .     Prove  Theorem  6 . 2 . 3 . 


is  differentiable  at  Xq  and  e  >  0,  there  is  a  S  >  0  such 


17.  Show  that  if  F  :  W  -> 
that 

|F(X)-F(X0)|  <  (||F'(Xo)||  +e)|X-X0|    if  |X-X0|<<5. 
Compare  this  with  Lemma  6.2.5. 

18.  Suppose  that  F  :  W  ->•  R"  is  differentiable  at  X0  and  F'(X0)  is  nonsingular.  Let 

1 


HtF'CXo)]-1 1| 

and  suppose  that  e  >  0.  Show  that  there  is  a  S  >  0  such  that 

|F(X)-F(X0)|  >  (r-e)|X-X0|    if    |X-X0|  <  5. 
Compare  this  with  Lemma  6.2.6. 
19.     Prove:  If  L  :  W      Km  is  defined  by  L(X)  =  A(X),  where  A  is  nonsingular,  then 


|L(X)-L(Y)|  > 


1 


IX-YI 


for  all  X  and  Y  in  R". 

20.     Use  Theorem  6.2.8  to  find  H'(U0),  where  H(U)  =  F(G(U).  Check  your  results  by 
expressing  H  directly  in  terms  of  U  and  differentiating. 


x2  +  y2  n 


(a)  F(x,y,z)  = 
(x/2,n/2,2) 

(b)  F(x,y)  = 

(c)  F(x,y,z)  = 
Uo  arbitrai-y 


x2  +  y2 


G(w,  v,  w) 


w  cos  u  sin  t; 
w  sin  u  sin  v 
w  cos  v 


U0 


2  2 
x  —  y 

y 

x 


G(u,  v) 


3x  +  4y  +  2z.  +  6 
Ax  -  2y  +  z-l 
-x  +   y  +   z.  —  2 


v  cos  u 
v  sinw 

G(u,  v)  = 


Uo  =  (jr/4,  3) 


M  —  V 
U  +  V 

u  —  2v 
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21. 


22. 


23. 


(d)  F(x,y)  = 

(e)  ¥(x,y)  = 

(f)  F(x,j)  = 


x  +  .y 

X- J 


^2  _|_  ^2 

X2 


3" 


x  +  2y 

2 

x  —  y 


x2  +  y 


G(u,  v,  w)  = 
G(w,  v)  = 

G(w,  v)  = 


2u  —  v  +  w 

2  2 


e  cos  v 
eu  sin  v 


U0  =  (1,1,-2) 
U0  =  (0,  0) 


u  +  2v 


2u 


d(huh 


2,  •  •  • 


U0  =  (1,-2) 
with  values  in  W ,  and 

hn)        3(/l,  f%,  ■  ■  ■  ,  fn)  d(gU  g2,  ■  ■  ■ ,  gn) 


Suppose  that  F  and  G  are  continuously  differentiable  on 
let  H  =  F  o  G.  Show  that 


d(ui,u2, . . .  ,u„)      d(xi,x2, . . .  ,xn)  d(ui,u2,  ■  ■  -,u„) 
Where  should  these  Jacobians  be  evaluated? 

Suppose  that  F  :  R"  — >  M.m  and  X  is  a  limit  point  of  £)p  contained  in  D^.  Show 
that  F  is  continuous  at  X  if  and  only  if  lim^oo  F(X^)  =  F(X)  whenever  {X^}  is  a 
sequence  of  points  in  Z)p  such  that  lim^oo      =  X.  Hint:  See  Exercise  5.2.15. 

Suppose  that  F  :  R"  M.m  is  continuous  on  a  compact  subset  S  of  M" .  Show  that 
F(S)  is  a  compact  subset  of  M.m. 


6.3  THE  INVERSE  FUNCTION  THEOREM 


So  far  our  discussion  of  transformations  has  dealt  mainly  with  properties  that  could  just  as 
well  be  defined  and  studied  by  considering  the  component  functions  individually.  Now  we 
turn  to  questions  involving  a  transformation  as  a  whole,  that  cannot  be  studied  by  regarding 
it  as  a  collection  of  independent  component  functions. 

In  this  section  we  restrict  our  attention  to  transformations  from  W  to  itself.  It  is  useful 
to  interpret  such  transformations  geometrically.  If  F  =  (/i,  f2, . . . ,  fn),  we  can  think  of 
the  components  of 

F(X)  =  (/1(X)./2(X),...,/„(X)) 

as  the  coordinates  of  a  point  U  =  F(X)  in  another  "copy"  of  M".  Thus,U  =  u2, . . . ,  un), 
with 

Ml  =  MX),      U2  =  f2(X),  Un=fn(X). 

We  say  that  F  maps  X  to  U,  and  that  U  is  the  image  ofX  under  F.  Occasionally  we  will 
also  write  dui/dxj  to  mean  dfi/dxj.  If  S  C  D$,  then  the  set 

F(S)  =  {U|U  =  F(X),  X  e  S} 

is  the  image  of  S  under  F. 

We  will  often  denote  the  components  of  X  by  x,  y,  . . . ,  and  the  components  of  U  by  u, 
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Example  6.3.1  If 


9  9 

x  +  y 

2  2 

x  —  y 


then 
and 


=  F(x,  y)  = 

"  =  fi(x,  y)  =  x2  +  y2,     v  =  f2(x,  y)  =  x2  -  y2, 


dfi(x,y)                              dfi(x,y)  , 
ux(x,  y)  =   =  2x,      uy(x,  y)  =   =  2y, 


3x 


dy 


,     dfz(x,y)  dfz(x,y) 
vx(x,  y)  =   —       =  2x,       vy(x,  y)  =   —       =  -2y. 


dx 


dy 


To  find  F(K2),  we  observe  that 


u  +  v  =  2x  ,     u  -v  =  2y  , 


so 


F(M2)  C  T  =  {(u,  v)  |  u  +  v  >  0,  u  -  v  >  0} , 
which  is  the  part  of  the  wu-plane  shaded  in  Figure  6.3.1.  If  (u,  v)  e  T,  then 

( -s/u  +  v   yju  —  v 


soF(R2)  =  T. 


u  —  v  =  0 


u  +  v  =  0 


Figure  6.3.1 
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Invertible  Transformations 


A  transformation  F  is  one-to-one,  or  invertible,  if  F(Xi)  and  F(X2)  are  distinct  whenever 
Xi  and  X2  are  distinct  points  of  DF.  In  this  case,  we  can  define  a  function  G  on  the  range 

R(F)  =  {U  I  U  =  F(X)  for  some  X  e  DF} 

of  F  by  defining  G(U)  to  be  the  unique  point  in  DF  such  that  F(U)  =  U.  Then 

Dq  =  R(¥)    and    R(G)  =  DF. 

Moreover,  G  is  one-to-one, 

G(F(X))  =  X.    X  e  DF, 

and 

F(G(U))  =  U.    U  €  DG. 

We  say  that  G  is  the  inverse  of  F,  and  write  G  =  F_1 .  The  relation  between  F  and  G  is 
symmetric;  that  is,  F  is  also  the  inverse  of  G,  and  we  write  F  =  G_1 . 

Example  6.3.2  The  linear  transformation 

V 

maps  (x,  y)  to  (u,  v),  where 


x-y 
x  +  y 


(6.3.1) 


(6.3.2) 


u  =  x  —  y, 
v  =  x  +  y. 

L  is  one-to-one  and  R(L)  =  R2,  since  for  each  (m,  v)  in  R2  there  is  exactly  one  (x,  y) 
such  that  L(x,  y)  =  (u,  v).  This  is  so  because  the  system  (6.3.2)  can  be  solved  uniquely 
for  (x,  y)  in  terms  of  (u,  v): 

x 


y 


j(u  +  v), 
\{-u  +  v). 


(6.3.3) 


Thus, 


v  ,  j  2 


U  +  V 
-u  +  V 


Example  6.3.3  The  linear  transformation 

"     =h1(x,y)  = 
maps  (x,  y)  onto  (it,  v),  where 

u  =  x  +  y, 
v  =  2x  +  2y, 


x  +  y 
2x  +  2y 


(6.3.4) 


Section  6.3  The  Inverse  Function  Theorem  397 


Li  is  not  one-to-one,  since  every  point  on  the  line 

x  +  y  =  c  (constant) 
is  mapped  onto  the  single  point  (c,  2c).  Hence,  Li  does  not  have  an  inverse.  ■ 

The  crucial  difference  between  the  transformations  of  Examples  6.3.2  and  6.3.3  is  that 
the  matrix  of  L  is  nonsingular  while  the  matrix  of  Li  is  singular.  Thus,  L  (see  (6.3. 1))  can 
be  written  as 


u 

1 

-1  " 

X 

V 

1 

1 

.  y  . 

(6.3.5) 


where  the  matrix  has  the  inverse 


_i_  l 

2  2 


'2  2 


(Verify.)  Multiplying  both  sides  of  (6.3.5)  by  this  matrix  yields 


1 

1 

2 

2 

U 

X 

1 

1 

V 

y 

2 

2  J 

which  is  equivalent  to  (6.3.3). 
Since  the  matrix 


1  1 

2  2 


of  Li  is  singular,  (6.3.4)  cannot  be  solved  uniquely  for  (x,  y)  in  terms  of  (u,  v).  In  fact,  it 
cannot  be  solved  at  all  unless  v  =  2u. 

The  following  theorem  settles  the  question  of  invertibility  of  linear  transformations  from 
R"  to  W.  We  leave  the  proof  to  you  (Exercise  6.3.2). 

Theorem  6.3.1  The  linear  transformation 

U  =  L(X)  =  AX    (Rn  E") 

is  invertible  if  and  only  if  A  is  nonsingular,  in  which  case  i?(L)  =  W  and 

L_1(U)  =  A_1U. 


Polar  Coordinates 

We  will  now  briefly  review  polar  coordinates,  which  we  will  use  in  some  of  the  following 
examples. 

The  coordinates  of  any  point  (x,  y)  can  be  written  in  infinitely  many  ways  as 

x  =  r  cos  6,     y  =  rsin6,  (6.3.6) 
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where 


and,  if  r  >  0,  9  is  the  angle  from  the  x-axis  to  the  line  segment  from  (0,  0)  to  (x,  y), 
measured  counterclockwise  (Figure  6.3.2). 


For  each  (x,y)  ^  (0,0)  there  are  infinitely  many  values  of  9,  differing  by  integral 
multiples  of  2tt,  that  satisfy  (6.3.6).  If  9  is  any  of  these  values,  we  say  that  9  is  an  argument 
of  (x,  y),  and  write 


By  itself,  this  does  not  define  a  function.  However,  if  <p  is  an  arbitrary  fixed  number,  then 


does  define  a  function,  since  every  half-open  interval  [</>,  (p  +  2n)  contains  exactly  one 
argument  of  (x,  y). 

We  do  not  define  arg(0,  0),  since  (6.3.6)  places  no  restriction  on  9  if  (x,  y)  =  (0,  0)  and 
therefore  r  =  0. 

The  transformation 


y 


Figure  6.3.2 


9  =  aig(x,y). 


9  =  arg(x,y),     4><9<(p  +  2jr, 


r 


^/xTTyI 


9 


=  G(x,y)  = 


<P  —  nrgi*'  y)  <  <P  +  27T, 


is  defined  and  one-to-one  on 


DG  =  {(x,y)\(x,y)^(Q,Q)}, 


and  its  range  is 


R(G)  =  {(r,  9)\r  >0,(j)  <9  <(j)  +2ti}. 
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For  example,  if  (p  =  0,  then 


G(l,  1) 


V2 

1 


since  it/4  is  the  unique  argument  of  (1,  1)  in  [0,  2tc).  If  <p  =  jt,  then 


G(l,l) 


V2 

9n 
L  ~4~ 


since  9it/4  is  the  unique  argument  of  (1,  1)  in  [n,  3n). 

If  arg(^o,  yo)  =  </>,  then  (xq,  yo)  is  on  the  half-line  shown  in  Figure  6.3.3  and  G  is 
not  continuous  at  (xq,  yo),  since  every  neighborhood  of  (xq,  yo)  contains  points  {x,  y)  for 
which  the  second  component  of  G(x,  y)  is  arbitrarily  close  to  (p  +  2tc,  while  the  second 
component  of  G(xq,  yo)  is  </>.  We  will  show  later,  however,  that  G  is  continuous,  in  fact, 
continuously  differentiable,  on  the  plane  with  this  half-line  deleted. 


y 


Figure  6.3.3 


Local  Invertibility 

A  transformation  F  may  fail  to  be  one-to-one,  but  be  one-to-one  on  a  subset  S  of  By 
this  we  mean  that  F(Xi)  and  F(X2)  are  distinct  whenever  Xi  and  X2  are  distinct  points  of 
S.  In  this  case,  F  is  not  invertible,  but  if  Fs  is  defined  on  S  by 

F5(X)  =  F(X),     X  e  S, 

and  left  undefined  for  X  $  S,  then  F^  is  invertible.  We  say  that  F5  is  the  restriction  o/F 
to  S,  and  that  F^1  is  the  inverse  o/F  restricted  to  S.  The  domain  of  F^1  is  F(S). 


400    Chapter  6  Vector-Valued  Functions  of  Several  Variables 


If  F  is  one-to-one  on  a  neighborhood  of  Xo,  we  say  that  F  is  locally  invertible  at  Xo.  If 
this  is  true  for  every  Xq  in  a  set  S,  then  F  is  locally  invertible  on  S . 


Example  6.3.4  The  transformation 

u  1  =F(x,y)  = 
v 

is  not  one-to-one,  since 


2  2 

x  —  y 
2xy 


F(-x,-y)=F(x,y). 


(6.3.7) 


(6.3.8) 


It  is  one-to-one  on  S  if  and  only  if  S  does  not  contain  any  pair  of  distinct  points  of  the  form 
(xo,  Vo)  and  (— xo,  —Jo);  (6.3.8)  implies  the  necessity  of  this  condition,  and  its  sufficiency 
follows  from  the  fact  that  if 

F(xi,  ji)  =  F(x0,  yo), 


then 

{x\,y\)  =  (*o,  yo)    or    (xi,y{)  =  (-x0,  -yo)- 
To  see  this,  suppose  that  (6.3.9)  holds;  then 


yi  = 


2  2 


and 


xiyi  =  x0y0. 

Squaring  both  sides  of  (6.3.1 1)  yields 

x\  -  2x\y\  +  y\  =  x\  -  2x20y%  +  y40. 
This  and  (6.3.12)  imply  that 


(6.3.9) 
(6.3.10) 

(6.3.11) 
(6.3.12) 


4444 

*i  -  *o  =  Jo  -  y\ ■ 


From  (6.3.11), 
Factoring  (6.3.13)  yields 


2      2  _  2 

x\   xo  —  y\ 


yl 


(6.3.13) 
(6.3.14) 


{x\  -  xl)^  +  x20)  =  {yl  -  y2)(y2  +  y2). 
If  either  side  of  (6.3.14)  is  nonzero,  we  can  cancel  to  obtain 

9  9  9  9 

xf  +  x2  =  -yi-yt, 

which  implies  that  xq  =  X\  =  yo  =  y\  =  0,  so  (6.3.10)  holds  in  this  case.  On  the  other 
hand,  if  both  sides  of  (6.3.14)  are  zero,  then 

xi  =  ±x0,    yi  =  ±yo- 

From  (6.3.12),  the  same  sign  must  be  chosen  in  these  equalities,  which  proves  that  (6.3.8) 
implies  (6.3.10)  in  this  case  also. 
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We  now  see,  for  example,  that  F  is  one-to-one  on  every  set  S  of  the  form 

S  =  {(x,  y)  |  ax  +  by  >  0}  , 

where  a  and  b  are  constants,  not  both  zero.  Geometrically,  S  is  an  open  half-plane;  that  is, 
the  set  of  points  on  one  side  of,  but  not  on,  the  line 

ax  +  by  =  0 

(Figure  6.3.4).  Therefore,  F  is  locally  invertible  at  every  Xo  ^  (0,  0),  since  every  such 
point  lies  in  a  half-plane  of  this  form.  However,  F  is  not  locally  invertible  at  (0,  0).  (Why 
not?)  Thus,  F  is  locally  invertible  on  the  entire  plane  with  (0,  0)  removed. 


Figure  6.3.4 

It  is  instructive  to  find  F^1  for  a  specific  choice  of  5.  Suppose  that  S  is  the  open  right 
half-plane: 

5  =  {(x,  y)  |  x  >  0} .  (6.3.15) 

Then  F(5)  is  the  entire  wu-plane  except  for  the  nonpositive  it  axis.  To  see  this,  note  that 
every  point  in  5  can  be  written  in  polar  coordinates  as 

n  n 

x  =  r  cos  9,     y  =  r  sin  (9,     r>0,  <  9  <  — . 

2  2 

Therefore,  from  (6.3.7),  F(x,  y)  has  coordinates  (w,  v),  where 

u  =  x2  -  y2  =  r2(cos2  9  -  sin2  9)  =  r2  cos  29, 
v  =  2xy  =  2r2  cos  9  sin  9  =  r  sin  29. 
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Every  point  in  the  wu-plane  can  be  written  in  polar  coordinates  as 

u  =  pcosa,     v  =  psina, 

where  either  p  =  0  or 

p  =  y/ u2  +  v2  >  0,    —  it  <  a  <  it, 

and  the  points  for  which  p  =  Oora  =  —n  are  of  the  form  (w,  0),  with  u  <  0  (Figure  6.3.5). 
If  (u,  v)  =  ¥(x,  y)  for  some  (x,  y)  in  S,  then  (6.3.15)  implies  that  p  >  0  and  —it  <  a  < 
it.  Conversely,  any  point  in  the  wu-plane  with  polar  coordinates  (p,a)  satisfying  these 
conditions  is  the  image  under  F  of  the  point 


(x,y)  =  (p1/2cosa/2.p1/zsino'/2)  e  S 


1/2, 


Thus, 


¥sHu,v)  = 


(u2  +  v2)1'4  cos(arg(w,  v)/2) 
(u2  +  U2)1/4  sin(arg(w,  v)/2 


-%  <  arg(w,  v)  <  jr. 


Figure  6.3.5 

Because  of  (6.3.8),  F  also  maps  the  open  left  half-plane 

Si  =  {(x,y)\x  <0} 

onto  F(S),  and 

(u2  +  v2)1!4  cos(arg(w,  v)/2) 

(u2  +  v2)1!4  sin(arg(w,  v)/2) 
=  -¥-s\u,v). 


F-Sl(u,v)  = 


7i  <  arg(w,  d)  <  3ti, 
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Example  6.3.5  The  transformation 

"  1  =F(je,y) 
v 

is  not  one-to-one,  since 


e  cos  y 
ex  sin  y 


F(x,j  +  2kn)  =  ¥(x,y) 


(6.3.16) 


(6.3.17) 


if  k  is  any  integer.  This  transformation  is  one-to-one  on  a  set  S  if  and  only  if  S  does  not 
contain  any  pair  of  points  (xo,  yo)  and  (xq,  yo  +  2kn),  where  k  is  a  nonzero  integer.  This 
condition  is  necessary  because  of  (6.3.17);  we  leave  it  to  you  to  show  that  it  is  sufficient 
(Exercise  6.3.8).  Therefore,  for  example,  F  is  one-to-one  on 


S</>  =  {(x,  y)  \  —  oo  <  x  <  oo,  <f>  <  y  <  4>  +  2it} 


(6.3.18) 


where  <p  is  arbitrary.  Geometrically,  5^  is  the  infinite  strip  bounded  by  the  lines  y  =  </>  and 
y  =  <p  +  2jt.  The  lower  boundary  is  in  S^,  but  the  upper  is  not  (Figure  6.3.6).  Since  every 
point  is  in  the  interior  of  some  such  strip,  F  is  locally  invertible  on  the  entire  plane. 


y 


Figure  6.3.6 

The  range  of  Fs^  is  the  entire  wu-plane  except  the  origin,  since  if  (u,  v)  ^  (0,  0),  then 
(u,v)  can  be  written  uniquely  as 


u 

pcosa 

V 

psina 

where 

p>0,    </>  <  a  <  cj>  +  2tt, 
so  (w,  v)  is  the  image  under  F  of 

(x,  y)  =  (logp.a)  e  5. 

The  origin  is  not  in  R(F),  since 

|F(x,  y)\2  =  (excosy)2  +  (ex  sin y  f  =  e2x  ^  0. 
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Fsl(u,v)  = 


4>  <  arg(w,  v)  <  (p  +  2jt. 


Finally, 

~  l0g(M2  +  V2)1'2 

arg(w,  i>) 

The  domain  of  F^*  is  the  entire  wu-plane  except  for  (0,  0). 
Regular  Transformations 

The  question  of  invertibility  of  an  arbitrary  transformation  F  :  R"  R"  is  too  general  to 
have  a  useful  answer.  However,  there  is  a  useful  and  easily  applicable  sufficient  condition 
which  implies  that  one-to-one  restrictions  of  continuously  differentiable  transformations 
have  continuously  differentiable  inverses. 

To  motivate  our  study  of  this  question,  let  us  first  consider  the  linear  transformation 


F(X)  =  AX  = 


an  ai2 

#21  a22 


0-2n 


X2 


From  Theorem  6.3.1,  F  is  invertible  if  and  only  if  A  is  nonsingular,  in  which  case  R(F)  = 
W  and 

F_1(U)  =  A_1U. 

Since  A  and  A-1  are  the  differential  matrices  of  F  and  F_1,  respectively,  we  can  say  that  a 
linear  transformation  is  invertible  if  and  only  if  its  differential  matrix  F'  is  nonsingular,  in 
which  case  the  differential  matrix  of  F_1  is  given  by 

(F-1/ =  (ft1. 

Because  of  this,  it  is  tempting  to  conjecture  that  if  F  :  R"  — >  R"  is  continuously  differen- 
tiable and  A'(X)  is  nonsingular,  or,  equivalently,  /F(X)  ^  0,  for  X  in  a  set  S,  then  F  is 
one-to-one  on  S.  However,  this  is  false.  For  example,  if 


F(x,j)  = 


then 


e  cos  y 
ex  siny 


e  cos  y 
ex  sin  y 

—ex  siny 
ex  cos  y 


=  elx  0, 


(6.3.19) 


but  F  is  not  one-to-one  on  R2  (Example  6.3.5).  The  best  that  can  be  said  in  general  is 
that  if  F  is  continuously  differentiable  and  /F(X)  ^  0  in  an  open  set  5,  then  F  is  locally 
invertible  on  S,  and  the  local  inverses  are  continuously  differentiable.  This  is  part  of  the 
inverse  function  theorem,  which  we  will  prove  presently.  First,  we  need  the  following 
definition. 
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Definition  6.3.2  A  transformation  F  :  R"  — >  R"  is  regular  on  an  open  set  S  if  F  is 
one-to-one  and  continuously  differentiable  on  5,  and  /F(X)  ^  0  if  X  e  5.  We  will  also 
say  that  F  is  regular  on  an  arbitrary  set  S  if  F  is  regular  on  an  open  set  containing  S. 


Example  6.3.6  If 

(Example  6.3.2),  then 


mx,y) 


x-y 
x  +  y 

1  -1 
1  1 


=  2, 


so  F  is  one-to-one  on  R2.  Hence,  F  is  regular  on  R2. 

x  +  y 


(Example  6.3.3),  then 


F(*,  y) 


J¥(x,y) 


so  F  is  not  regular  on  any  subset  of  R2. 
If 


2x  +  2y 


1  1 

2  2 


2  2 

x  —  y 
2xy 


(Example  6.3.4),  then 


J¥(x,y)  = 


2x  —2y 
2y  2x 


=  2{x2  +  y2), 


so  F  is  regular  on  any  open  set  S  on  which  F  is  one-to-one,  provided  that  (0,  0)  $  S.  For  ex- 
ample, F  is  regular  on  the  open  half-plane  {(x,  y)\x>  0},  since  we  saw  in  Example  6.3.4 
that  F  is  one-to-one  on  this  half-plane. 

If 

r  ex  cos  y 


e  cos  y 


(Example  6.3.5),  then  JF(x,  y)  =  e2x  (see  (6.3.19)),  so  F  is  regular  on  any  open  set  on 
which  it  is  one-to-one.  The  interior  of     in  (6.3. 18)  is  an  example  of  such  a  set. 


Theorem  6.3.3  Suppose  that  F  :  R"  — >  R"  is  regular  on  an  open  set  S,  and  let 
G  =  F7  .  Then  F(S)  is  open,  G  is  continuously  differentiable  on  F(5),  and 

G'(U)  =  (F'(X))-\     where    U  =  F(X). 

Moreover,  since  G  is  one-to-one  on  F(S),  G  is  regular  on  F(S). 
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Proof  We  first  show  that  if  X0  €  S,  then  a  neighborhood  of  F(X0)  is  in  F(S).  This 
implies  that  F(S)  is  open. 

Since  S  is  open,  there  is  a  p  >  0  such  that  Bp(Xo)  C  S.  Let  B  be  the  boundary  of 
5p(X0);thus, 

B  =  {\X}  |X-X0|  =  p.  (6.3.20) 

The  function 

a(X)=  |F(X)-F(X0)| 

is  continuous  on  S  and  therefore  on  B,  which  is  compact.  Hence,  by  Theorem  5.2.12,  there 
is  a  point  Xi  in  B  where  ct(X)  attains  its  minimum  value,  say  m,  on  B.  Moreover,  m  >  0, 
since  Xi  ^  Xo  and  F  is  one-to-one  on  S.  Therefore, 

|F(X)  -F(X0)|  >  m  >  0    if    \X-X0\=p.  (6.3.21) 

The  set 

{U||U-F(X0)|  <m/2} 

is  a  neighborhood  of  F(Xo).  We  will  show  that  it  is  a  subset  of  F(S).  To  see  this,  let  U  be 
a  fixed  point  in  this  set;  thus, 

|U-F(X0)|  <m/2.  (6.3.22) 

Consider  the  function 

C7!(X)  =  |U-F(X)|2, 
which  is  continuous  on  S.  Note  that 

2 

cti(X)>^-    if    \X-Xo\=p,  (6.3.23) 

since  if  |X  —  Xo|  =  p,  then 

|U-F(X)|  =  |(U-F(Xo))  +  (F(X0)-F(X))| 

>  ||F(Xo)-F(X)|-|U-F(X0)|| 

m  m 

>  m  =  — , 

2  2 

from  (6.3.21)  and  (6.3.22). 

Since  a\  is  continuous  on  S,  o\  attains  a  minimum  value  [i  on  the  compact  set  Bp(Xo) 
(Theorem  5.2.12);  that  is,  there  is  an  X  in  Bp(Xq)  such  that 


(71  (X)  >(7!(X)  =fl,  X€Bp(X0). 

Setting  X  =  Xo,  we  conclude  from  this  and  (6.3.22)  that 

2 

—  m 
o-i  (X)  =  \L  <  CTi(Xo)  <  — . 

Because  of  (6.3.20)  and  (6.3.23),  this  rules  out  the  possibility  that  X  e  B,  so  X  e  Bp(Xq). 
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Now  we  want  to  show  that  n  =  0;  that  is,  U  =  F(X).  To  this  end,  we  note  that  <7i(X) 
can  be  written  as 

n 

<ri(X)  =       (uJ-fj  (X))2- 

j  =  l 

so  o\  is  differentiable  on  Bp(Xo).  Therefore,  the  first  partial  derivatives  of  a\  are  all  zero 
at  the  local  minimum  point  X  (Theorem  5.3.1 1),  so 

E^(»/-//(X))=0,  !<<<», 
j=i  1 

or,  in  matrix  form, 

F'(X)(U-F(X))  =  0. 

Since  F'(X)  is  nonsingular  this  implies  that  U  =  F(X)  (Theorem  6.1.13).  Thus,  we  have 
shown  that  every  U  that  satisfies  (6.3.22)  is  in  F(5).  Therefore,  since  Xo  is  an  arbitrary 
point  of  S,  F(S)  is  open. 

Next,  we  show  that  G  is  continuous  on  F(S).  Suppose  that  Uo  €  F(S)  and  Xo  is  the 
unique  point  in  S  such  that  F(Xo)  =  Uo-  Since  F'(Xo)  is  invertible,  Lemma  6.2.6  implies 
that  there  is  a  A  >  0  and  an  open  neighborhood  N  of  Xo  such  that  N  C  S  and 

|F(X)-F(X0)|  >  A|X-X0|    if   X  €  N.  (6.3.24) 

(Exercise  6.2.18  also  implies  this.)  Since  F  satisfies  the  hypotheses  of  the  present  theorem 
on  N,  the  first  part  of  this  proof  shows  that  F(N)  is  an  open  set  containing  Uo  =  F(Xo). 
Therefore,  there  is  a  8  >  0  such  thatX  =  G(U)  is  in  N  if  U  €  Bg(V0).  Setting  X  =  G(U) 
and  X0  =  G(U0)  in  (6.3.24)  yields 

|F(G(U))  -  F(G(U0))|  >  A|G(U)  -  G(U0)|    if    U  e  2>4(C0). 
Since  F(G(U))  =  U,  this  can  be  rewritten  as 

|G(U)-G(U0)|  <-|U-U0|    if   UeBi(Uo),  (6.3.25) 

which  means  that  G  is  continuous  at  Uo-  Since  Uo  is  an  arbitrary  point  in  F(5),  it  follows 
that  G  is  continous  on  F(S). 

We  will  now  show  that  G  is  differentiable  at  Uo-  Since 

G(F(X))  =  X,    X  €  S, 
the  chain  rule  (Theorem  6.2.8)  implies  that G  is  differentiable  at  Uo,  then 


G'(Uo)F'(X0)  =  I 
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(Example  6.2.3).  Therefore,  if  G  is  differentiable  at  Uo,  the  differential  matrix  of  G  must 
be 

G'(Uo)  =  [F'(Xo)]-1, 


so  to  show  that  G  is  differentiable  at  Uo,  we  must  show  that  if 

H(u)  =  G(U)-G(Uo)-[r(x0)]-Hu-Uo)  (U^Uo)_ 


then 


|U-Uo| 


lim  H(U)  =  0. 


(6.3.26) 


(6.3.27) 


Since  F  is  one-to-one  on  5  and  F(G(U))  =  U,  it  follows  that  if  U  ^  U0,  then  G(U)  ^ 
G(Uo).  Therefore,  we  can  multiply  the  numerator  and  denominator  of  (6.3.26)  by  |G(U)  — 
G(U0)|  to  obtain 


H(U)  = 


G(U)  -  G(U0|  / G(U)  -  G(U0)  -  [F'(Xo)]-1  (U  -  Uo)' 


|U-Uo| 
|G(U)-G(U0 


[F'(Xo)]"1  (- 


G(U)-G(U0)|  J 

-Uo-F'(X0)(G(U)-G(Uo))' 


|U-Uo|  V  |G(U)-G(U0)| 

if  0  <  |U  —  Uo|  <  S.  Because  of  (6.3.25),  this  implies  that 


|H(U)|  <  IllfF'CXo)]-1! 
ifO  <  |U-U0|  <  8.  Now  let 


U  -  Uo  -  F'(X0)(G(U)  -  G(U0)) 


|G(U)-G(U0)| 


Hi(U)  = 


U-Uo-F(X0)(G(U)-G(Uq)) 
|G(U)-G(U0)| 

To  complete  the  proof  of  (6.3.27),  we  must  show  that 

lim  Hi(U)  =  0. 

Since  F  is  differentiable  at  Xo,  we  know  that  if 

F(X)-F(Xo)-F'(X0)(X-Xo) 


(6.3.28) 


H2(X)  =  lim 

x^x0 


IX -Xo 


then 


lim  H2(X)  =  0. 

x^x0 


(6.3.29) 


Since  F(G(U))  =  U  and  X0  =  G(U0), 

Hi(U)  =  H2(G(U)). 
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Now  suppose  that  e  >  0.  From  (6.3.29),  there  is  a  5i  >  0  such  that 

|H2(X)|<e    if    0  <  |X-X0|  =  |X-G(U0)|  <  8t.  (6.3.30) 
Since  G  is  continuous  at  Uo,  there  is  a  <52  e  (0,  S)  such  that 

|G(U)-G(U0)|  <  Si    if    0  <  |U-Uo|  <  <52- 
This  and  (6.3.30)  imply  that 

|Hi(U)|  =  |H2(G(U))|  <e    if  0<|U-U0|<52. 

Since  this  implies  (6.3.28),  G  is  differentiable  at  Xo. 

Since  Uo  is  an  arbitrary  member  of  F(N),  we  can  now  drop  the  zero  subscript  and 
conclude  that  G  is  continuous  and  differentiable  on  F(N),  and 

G(U)  =  [F'(X)]-1.  UeF(JV). 

To  see  that  G  is  continuously  differentiable  on  F(N),  we  observe  that  by  Theorem  6.1.14, 
each  entry  of  G'(U)  (that  is,  each  partial  derivative  dgj(XJ)/du j ,  1  <  i,  j  <  n)  can  be 
written  as  the  ratio,  with  nonzero  denominator,  of  determinants  with  entries  of  the  form 

df'fW».  (6.3.31) 

dxs 

Since  dfr/dxs  is  continuous  on  N  and  G  is  continuous  on  F(N),  Theorem  5.2.10  implies 
that  (6.3.31)  is  continuous  on  F(N).  Since  a  determinant  is  a  continuous  function  of  its 
entries,  it  now  follows  that  the  entries  of  G(U)  are  continuous  on  F(N).  H 


Branches  of  the  Inverse 

If  F  is  regular  on  an  open  set  S,  we  say  that  F^1  is  a  branch  of  F_1.  (This  is  a  convenient 
terminology  but  is  not  meant  to  imply  that  F  actually  has  an  inverse.)  From  this  definition, 
it  is  possible  to  define  a  branch  of  F_1  on  a  set  T  C  R(F)  if  and  only  if  T  =  F(S),  where 
F  is  regular  on  S.  There  may  be  open  subsets  of  R(F)  that  do  not  have  this  property,  and 
therefore  no  branch  of  F_1  can  be  defined  on  them.  It  is  also  possible  that  T  =  F(Si)  = 
F(S2),  where  Si  and  S2  are  distinct  subsets  of  In  this  case,  more  than  one  branch  of 
F_1  is  defined  on  T.  Thus,  we  saw  in  Example  6.3.4  that  two  branches  of  F_1  may  be 
defined  on  a  set  T.  In  Example  6.3.5  infinitely  many  branches  of  F-1  are  defined  on  the 
same  set. 

It  is  useful  to  define  branches  of  the  argument  To  do  this,  we  think  of  the  relationship 
between  polar  and  rectangular  coordinates  in  terms  of  the  transformation 


x 
v 


F(r,  9)  = 


r  cos  ( 
r  sin  t 


(6.3.32) 


where  for  the  moment  we  regard  r  and  6  as  rectangular  coordinates  of  a  point  in  an  r6- 
plane.  Let  5  be  an  open  subset  of  the  right  half  of  this  plane  (that  is,  S  C  {(r,  9)  |  r  >  0}) 
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that  does  not  contain  any  pair  of  points  (r,  9)  and  (r,  9  +  2kn),  where  k  is  a  nonzero  integer. 
Then  F  is  one-to-one  and  continuously  differentiable  on  S,  with 


F'(r,  9) 


cos  ( 
sin  ( 


-r  sin  ( 

r  cos  ( 


and 


JF(r,  9)  =  r  >  0,    (r,  9)  e  S. 


(6.3.33) 


(6.3.34) 


Hence,  F  is  regular  on  S.  Now  let  T  =  F(S),  the  set  of  points  in  the  xy-plane  with 
polar  coordinates  in  S.  Theorem  6.3.3  states  that  T  is  open  and  Fs  has  a  continuously 
differentiable  inverse  (which  we  denote  by  G,  rather  than  F^ 1 ,  for  typographical  reasons) 


=  G(x,y)  = 


(x,y)eT, 


y/x2  +  y2 
argsO,.y) 

where  args(x,  y)  is  the  unique  value  of  arg(x,  y)  such  that 

(r,9)  =  (T^TT1.  args(x,y))  e  S. 

We  say  that  args(x,  y)  is  a  branch  of  the  argument  defined  on  T .  Theorem  6.3.3  also 
implies  that 


G'(x,y)=[F'(r,9)]  '  = 


COS  I 

sin  t 


sin  ( 

cos  6 


(see  (6.3.33)) 


y 


,Jx2  +  y2     ,Jx2  +  y2 
y  x 


x2  +  y' 


x2  +  y' 


(see  (6.3.32)). 


Therefore, 


daigs(x,y) 
dx 


x1  +  y1 


3arg5(%,y) 
3y 


xz  +  y1 


(6.3.35) 


A  branch  of  arg(x,  y)  can  be  defined  on  an  open  set  T  of  the  x_y-plane  if  and  only  if 
the  polar  coordinates  of  the  points  in  T  form  an  open  subset  of  the  r9  -plane  that  does  not 
intersect  the  0-axis  or  contain  any  two  points  of  the  form  (r,  9)  and  (r,  9  +  2kn),  where 
k  is  a  nonzero  integer.  No  subset  containing  the  origin  (x,  y  )  =  (0,  0)  has  this  property, 
nor  does  any  deleted  neighborhood  of  the  origin  (Exercise  6.3.14),  so  there  are  open  sets 
on  which  no  branch  of  the  argument  can  be  defined.  However,  if  one  branch  can  be  defined 
on  T,  then  so  can  infinitely  many  others.  (Why?)  All  branches  of  arg(*,  y)  have  the  same 
partial  derivatives,  given  in  (6.3.35). 
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Example  6.3.7  The  set 

T  =  {(x,  y)  |  (x,  y)  ^  (x,  0)    with   x  >  0} , 

which  is  the  entire  xy-p\ane  with  the  nonnegative  x-axis  deleted,  can  be  written  as  T  = 
F(Sk),  where  F  is  as  in  (6.3.32),  k  is  an  integer,  and 

Sk  =  {(r,  9)  |  r  >  0,  2kir  <  6  <  2(k  +  l)jt} . 

For  each  integer  k,  we  can  define  a  branch  arg^  (x,  y)  of  the  argument  in  Sk  by  taking 
argSfr  (x,  y)  to  be  the  value  of  arg(x,  y)  that  satisfies 

2kn  <  zrgSk(x,y)  <  2(k  +  1)jt. 

Each  of  these  branches  is  continuously  differentiable  in  T ,  with  derivatives  as  given  in 
(6.3.35),  and 

MgSk(x,  y)  ~  argS/.(jc,  y)  =  2(k  -  j)n,    (x,  y)  e  T. 
Example  6.3.8  Returning  to  the  transformation 

=  F(x,y)  = 


u 

v 


2  2 

x  —  y 


2xy 


we  now  see  from  Example  6.3.4  that  a  branch  G  of  F  1  can  be  denned  on  any  subset  T  of 
the  uv -plane  on  which  a  branch  of  arg(w,  v)  can  be  defined,  and  G  has  the  form 


A' 

y 


G(u,v)  = 


(u2  +  v2)1/4  cos(arg(w,  i>)/2) 
(u2  +  v2)1/4  sin(arg(w,  v)/2) 


(u,  v)  €  T,  (6.3.36) 


where  arg(w,  v)  is  a  branch  of  the  argument  defined  on  T.  If  Gi  and  G2  are  different 
branches  of  F-1  defined  on  the  same  set  T,  then  Gi  =  ±G2.  (Why?) 

From  Theorem  6.3.3, 


G'(u,v)  =  [F'(x,y)]  '  = 


2x  —2y 
2y  2x 

1 


-1-1 


x  y 

-y  x 


2(x2  +  y2) 

Substituting  for  x  and  y  in  terms  of  u  and  v  from  (6.3.36),  we  find  that 


and 


dx 

du 

dx 

9 1) 


dy 
3v 


1 


du 


2(x2  +  y2)      2(u2  +  v2yl4 

y  1 


cos(arg(w,  v)/2)  (6.3.37) 
sin(arg(M,  v)/2).  (6.3.38) 


2{x2  +  y2)      2(u2  +  v2)1!4 
It  is  essential  that  the  same  branch  of  the  argument  be  used  here  and  in  (6.3.36) 
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We  leave  it  to  you  (Exercise  6.3.16)  to  verify  that  (6.3.37)  and  (6.3.38)  can  also  be 
obtained  by  differentiating  (6.3.36)  directly. 


Example  6.3.9  If 


=  F(x,y)  = 


e  cos  y 
ex  siny 


(Example  6.3.5),  we  can  also  define  a  branch  G  of  F  1  on  any  subset  T  of  the  wu-plane  on 
which  a  branch  of  arg(w,  v)  can  be  defined,  and  G  has  the  form 


x 

y 


=  G(u,  v)  = 


log(w2  +  i>2)1/2 
arg(w,  i>) 


(6.3.39) 


Since  the  branches  of  the  argument  differ  by  integral  multiples  of  2n,  (6.3.39)  implies  that 
if  Gi  and  G2  are  branches  of  F_1,  both  defined  on  T ,  then 


Gi(m,  v)  —  G2(u,  v) 
From  Theorem  6.3.3, 

G'(u,v)  =  [F'(x,y)Y1 


0 

2k  71 


(k  =  integer). 


-1-1 


e  cos  y  —  e  siny 
ex  smy  excos>' 

e~xcosy  e_xsin>' 
—e~xsmy    e~x  cosy 


Substituting  for  x  and  y  in  terms  of  u  and  v  from  (6.3.39),  we  find  that 


dx  by 
du  dv 


— r  — It  u 

=  e     cos  y  =  e      u  =  —  - 

uz  +  vz 


and 


dx        3y       _r  _7r  v 

—  =  -—  =e  xsmy  =  e  zxv  = 

dv        du  uz  +  vz 


The  Inverse  Function  Theorem 

Examples  6.3.4  and  6.3.5  show  that  a  continuously  differentiable  function  F  may  fail  to 
have  an  inverse  on  a  set  S  even  if  /F(X)  7^  0  on  5.  However,  the  next  theorem  shows  that 
in  this  case  F  is  locally  invertible  on  S. 

Theorem  6.3.4  (The  Inverse  Function  Theorem)  Let  F  :  W  ->•  W  be 

continuously  differentiable  on  an  open  set  S,  and  suppose  that  /F(X)  7^  0  on  S.  Then,  if 
Xo  €  S,  there  is  an  open  neighborhood  N  0/X0  on  which  F  is  regular.  Moreover,  ¥(N) 
is  open  and  G  =  F^  is  continuously  differentiable  on  ¥(N),  with 


G'(U)  =  [F'(X)]  1      (where  U  =  F(X)),    U  €  ¥(N). 
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Proof  Lemma  6.2.6  implies  that  there  is  an  open  neighborhood  N  of  Xo  on  which  F  is 
one-to-one.  The  rest  of  the  conclusions  then  follow  from  applying  Theorem  6.3.3  to  F  on 
N.  '  U 

Corollary  6.3.5  If 'F  is  continuously  differentiable  on  a  neighborhood  of 'Xo  and /F(Xo)  ^ 
0,  then  there  is  an  open  neighborhood  N  o/Xo  on  which  the  conclusions  of  Theorem  6.3.4 
hold. 

Proof  By  continuity,  since  /F'(Xo)  7^  0,  /F'(X)  is  nonzero  for  all  X  in  some  open 
neighborhood  5  of  Xq.  Now  apply  Theorem  6.3.4.  H 


Example  6.3.10  Let  X0  =  (1, 2, 1)  and 

--F(x,y,z)-- 


u 

V 

w 


Then 


F'(x,y,z)  = 


1 

2x  —  2 
1 


x  +  y  +  (z  ■ 
y  +  z,  +  (x 
Z.  +  x  +  (y  ■ 

1 
1 

2y-4 


1)  2  +  1 
•  l)2-  1 

2)  2  +  3 


2z  -2 
1 
1 


/F(Xo)  = 


=  2. 


In  this  case,  it  is  difficult  to  describe  N  or  find  G  =  F^1  explicitly;  however,  we  know  that 
F(A0  is  a  neighborhood  of  U0  =  F(X0)  =  (4,  2,  5),  that  G(U0)  =  X0  =  (1,2,  1),  and 
that 


G'(Uo)  =  [F'(Xo)]  1  = 


Therefore, 


G(U) 


"  1  n 

1 

2 

1 

+  2 

1 
1 

-1 


-1 
1 
1 


-1 
1 
1 


1 

-1 
1 


u  —  4 
v-2 
w  —  5 


where 


lim 


E(U) 


+  E(U), 


0; 


u-K4,2,s)  y/(u  -  4)2  +  (v-  2)2  +  (10  -  5)2 
thus  we  have  approximated  G  near  Uo  =  (4,  2,  5)  by  an  affine  transformation.  ■ 

Theorem  6.3.4  and  (6.3.34)  imply  that  the  transformation  (6.3.32)  is  locally  invertible 
on  S  =  {(r,  6)  \  r  >  0},  which  means  that  it  is  possible  to  define  a  branch  of  arg(x,  y)  in  a 
neighborhood  of  any  point  (xq,  yo)  ^  (0,  0).  It  also  implies,  as  we  have  already  seen,  that 
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the  transformation  (6.3.7)  of  Example  6.3.4  is  locally  invertible  everywhere  except  at  (0,  0), 
where  its  Jacobian  equals  zero,  and  the  transformation  (6.3.16)  of  Example  6.3.5  is  locally 
invertible  everywhere. 

6.3  Exercises 


1.  Prove:  If  F  is  invertible,  then  F_1  is  unique. 

2 .  Prove  Theorem  6 . 3 . 1 . 

3.  Prove:  The  linear  transformation  L(X)  =  AX  cannot  be  one-to-one  on  any  open  set 
if  A  is  singular.  Hint:  Use  Theorem  6.1.15. 

4.  Let 


G(x,y)  = 


r 


n/2  <  arg(x,  y)  <  5n/2. 


arg(x,y) 
Find 

(a)  G(0, 1)  (b)  G(1,0)  (c)  G(-l,  0) 

(d)G(2,2)  (e)G(-l,l) 

5.  Same  as  Exercise  6.3.4,  except  that  —  2tt  <  arg(x,  y)  <  0. 

6.  (a)    Prove:  If  /  :  K  — >  K  is  continuous  and  locally  invertible  on  (a,  b),  then  /  is 

invertible  on  (a,b). 

(b)  Give  an  example  showing  that  the  continuity  assumption  is  needed  in  (a). 

7.  Let 

F(*,j)  = 


2  2 

x  —  y 


2xy 

(Example  6.3.4)  and 

S  =  {(x,y)  \ax  +by  >  0}    (a2  +  b2  ^  0). 


FindF(S)  and  F"1.  If 


Si  =  {(x,  y)  |  ax  +  by  <  0}  , 


show  that  F(5i )  =  F(S)  and  F^1  =  -F^1 . 
8.     Show  that  the  transformation 


=  F(x,y)  = 


e  cos  y 
ex  sin  j 


(Example  6.3.5)  is  one-to-one  on  any  set  S  that  does  not  contain  any  pair  of  points 
(xo,  yo)  and  (xo,  yo  +  2kn),  where  k  is  a  nonzero  integer. 
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9.     Suppose  that  F 


10. 


11. 


12. 


13. 


14. 


is  continuous  and  invertible  on  a  compact  set  S.  Show 


that  Fs  is  continuous.  Hint:  IfFs  is  not  continuous  at  U  in  F(S),  then  there  is 
an  €q  >  0  and  a  sequence  {U^}  in  F(5)  such  that  lim^oo  U/t  =  U  w/z/Ze 

|F/(U^)-F/(U)|  >€0,  fc>l. 

[Ave  Exercise  5.1.32  to  obtain  a  contradiction. 
FindF"1  and  (F"1)': 

4x  +  2y 
-3x+  y 


(a) 
(b) 


F(jc,y) 


M  —  X  +  J  +  2z 

v  =  F(x,  y,Z)=  3x  +  y  —  4z, 
w  —x  —  y  +  2z 

In  addition  to  the  assumptions  of  Theorem  6.3.3,  suppose  that  all  gth-order  (q  >  1) 
partial  derivatives  of  the  components  of  F  are  continuous  on  S.  Show  that  all  gth- 
order  partial  derivatives  of  F^1  are  continuous  on  F(S). 

If 

u  1     ^      „      f  x2  +  y2 


r 


F(x,  y)  = 

(Example  6.3.1),  find  four  branches  Gi,  G2,  G3,  and  G4  of  F_1  defined  on 

T\  =  {(w,  v)  J  u  +  v  >  0,  u  —  v  >  0}  , 

and  verify  that  G-(w,  v)  =  (F'(x(u,  v),  y{u,  v)))~l ,  1  <  i  <  4. 
Suppose  that  A  is  a  nonsingular  n  x  n  matrix  and 

v2 


U  =  F(X)  =  A 


(a) 


Show  that  F  is  regular  on  the  set 

S  =  {X\eiXt  >  0,  1  <  i  <  11} , 

where  e,-  =  ±1, 1  <  i  <  n. 
(b)   FindF^HU).    (c)  Find  (F"1)'^). 
Let  9(x,  y)  be  a  branch  of  arg(x,  y)  defined  on  an  open  set  S. 

(a)  Show  that  9(x,  y)  cannot  assume  a  local  extreme  value  at  any  point  of  5. 

(b)  Prove:  If  a  7^  0  and  the  line  segment  from  (xq,  yo)  to  (axo,  ayo)  is  in  S,  then 
6(ax0,ay0)  =  6(x0,y0). 

(c)  Show  that  5  cannot  contain  a  subset  of  the  form 


A  = 


(x,  y)  I  0  <  r\  <  y/x2  +  y2  <  r2| . 
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(d)    Show  that  no  branch  of  arg(x,  y)  can  be  defined  on  a  deleted  neighborhood 
of  the  origin. 

15.     Obtain  Eqn.  (6.3.35)  formally  by  differentiating: 


(a)  arg(x,>>)  =  cos  1 


(b)  arg{x,  y)  =  sin  1 


(c)  arg(x,  y)  =  tan  1  — 

Where  do  these  formulas  come  from?  What  is  the  disadvantage  of  using  any  one  of 
them  to  define  arg(jc,  y)l 


16.     For  the  transformation 


=  F(x,y)  = 


2  2 

x  —  y 
2xy 


(Example  6.3.4),  find  a  branch  G  of  F_1  defined  on  T  =  {(m,  v)  |  au  +  bv  >  0}. 
Find  G'  by  means  of  the  formula  G'(U)  =  [F'(X)]_1  of  Theorem  6.3.3,  and  also  by 
direct  differentiation  with  respect  to  u  and  v. 


17.     A  transformation 


u{x,y) 
v(x,y) 


F(x,y)  = 

is  analytic  on  a  set  S  if  it  is  continuously  differentiable  and 

ux  =  vy,    uy  =  -vx 

on  S.  Prove:  If  F  is  analytic  and  regular  on  5,  then  F^1  is  analytic  on  F(5);  that  is, 
xu  =  uv  and  xv  =  -uu. 

18.  Prove:  If  U  =  F(X)  and  X  =  G(U)  are  inverse  functions,  then 

3(mi,  u2,  ...,u„)  d(xi,x2,  ...,X„) 
d(xi,x2,...,xn  9(«i,  u2, . . . ,  un) 

Where  should  the  Jacobians  be  evaluated? 

19.  Give  an  example  of  a  transformation  F  :  W      W  that  is  invertible  but  not  regular 


20.     Find  an  affine  transformation  A  that  so  well  approximates  the  branch  G  of  F  1 
defined  near  Uo  =  F(Xo)  that 

G(U)-A(U) 
lim  — — — —  =  0. 


(a) 


F(jc,j) 


u^u0  |U-U0| 

x4y5  —  Ax 
x3y2  — 3y 


X0  =  (1,-1) 
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(b) 
(c) 

(d) 


u 

V 

u 

V 

w 
u 

V 

w 


=  F(x,y)  = 


x  y  +  xy 
2xy  +  xy2 


X0  =  (1,  1) 


F(x,y,z)  = 


2x2y  +  x3  +  z. 
x3  +  yz, 
x  +  y  +  z. 

x  cos  y  cos  z. 
x  sin  y  cos  z 
x  sinz 


X  =  (0,  1,  1) 


X0  =  (l,jr/2,jr) 


21.     IfF  is  denned  by 


x 

y 


=  F(r,  9,  <p)  = 


r  cos  9  cos  <p 
r  sin  9  cos  (p 
r  sin  (6 


and  G  is  a  branch  of  F  ,  find  G'  in  terms  of  r,  9,  and  (p.  Hint:  See  Exer- 
cise 6.2.14(6). 


22.     IfF  is  defined  by 


x 

y 
z 


=  F(r,  9,  z)  = 


r  cos  i 
r  sin  ( 

Z 


and  G  is  a  branch  of  F  l,  find  G  in  terms  of  r,  9,  and  z.  Hint:  See  Exer- 
cise 6.2.14(c). 

23.  Suppose  that  F  :  W  — »  W  is  regular  on  a  compact  set  7".  Show  that  F(3T)  = 
3F(7");  that  is,  boundary  points  map  to  boundary  points.  Hint:  Use  Exercise  6.2.23 
and  Theorem  6.3.3  to  show  that  3F(7")  C  F(37").  Then  apply  this  result  with  F  and 
T  replaced  by  ¥~l  and¥(T)  to  show  that  F(9T)  C  3F(T). 


6.4  THE  IMPLICIT  FUNCTION  THEOREM 

In  this  section  we  consider  transformations  from  W+m  to  M.m.  It  will  be  convenient  to 
denote  points  in  Rn+m  by 

(X,  U)  =  (xi,x2, . .. ,  xn,  u\,  u2,...,  um). 

We  will  often  denote  the  components  of  X  by  x,  y,  . . . ,  and  the  components  of  U  by  u,  v, 

To  motivate  the  problem  we  are  interested  in,  we  first  ask  whether  the  linear  system  of 
m  equations  in  m  +  n  variables 

anxi  +  a\2X2  +  ■  ■  ■  +  a\nxn  +  b\\U\  +  b\2u2  +  ■  ■  ■  +  b\mum  =  0 
a2i*i  +  a22X2  +  ■■■  +  a2nxn  +  b2\uY  +  b22ux  +  ■■■  +  b2mum  =0 

.  (6.4.1) 

amixi  +  am2x2  +  ■■■  +  amnxn  +  bm\U\  +  bm2u2  +  h  bmmum  =  0 
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determines  u\,  M2,  . . . ,  um  uniquely  in  terms  of  Xi,  x%,  xn.  By  rewriting  the  system  in 
matrix  form  as 

AX  +  BU  =  0, 


where 


an  Ai2 

0-m\  am2 


Q-2n 


B 


b\\  b\2 
bj\  b22 

bml  bm2 


b\m 
b2m 


X  = 


X2 


and    U  = 


"1 

u2 


we  see  that  (6.4.1)  can  be  solved  uniquely  for  U  in  terms  of  X  if  the  square  matrix  B  is 
nonsingular.  In  this  case  the  solution  is 

U  =  -B_1AX. 

For  our  purposes  it  is  convenient  to  restate  this:  If 

F(X,  U)  =  AX  +  BU,  (6.4.2) 
where  B  is  nonsingular,  then  the  system 

F(X,  U)  =  0 

determines  U  as  a  function  of  X,  for  all  X  in  R" . 

Notice  that  F  in  (6.4.2)  is  a  linear  transformation.  If  F  is  a  more  general  transformation 
from  M"+m  to  Rm,  we  can  still  ask  whether  the  system 


F(X,  U)  =  0, 


or,  in  terms  of  components, 


fl(Xi,X2,  Ml,  1*2.  ••■  >  um)  =  0 

fz(Xl,X2,  .  .  .  ,Xn,Ul,U%,  .  .  .  ,  Um)  =  0 

fm{x\,X2,  ...,Xn,Ul,Uz,...,Um)=0, 

can  be  solved  for  U  in  terms  of  X.  However,  the  situation  is  now  more  complicated,  even 
if  m  =  1 .  For  example,  suppose  that  m  =  1  and 


f{x,  y,u)  =  1  —  x2  —  y2  —  u2. 
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If  x2  +  y2  >  1,  then  no  value  of  u  satisfies 

f(x,y,u)  =  0.  (6.4.3) 
However,  infinitely  many  functions  u  =  u(x,  y)  satisfy  (6.4.3)  on  the  set 

S  =  {(x,y)\x2  +  y2  <  1}. 

They  are  of  the  form 

u(x,  y)  =  e(x,  y)\/\  —  x2  —  y2, 

where  e(x,  y)  can  be  chosen  arbitrarily,  for  each  (x,  y)  in  S,  to  be  1  or  — 1.  We  can  narrow 
the  choice  of  functions  to  two  by  requiring  that  u  be  continuous  on  S ;  then 

u(x,  y)  =  y/l  -x2  -y2  (6.4.4) 

or 

u(x,y)  =  —  yl  —  x2  —  y2. 

4.3)  by  specifying  its  value  at  a  single 

1 

7!' 

F(X,  U)  =  0 

determines  U  as  a  function  of  X  is  too  general  to  have  a  useful  answer.  However,  there 
is  a  theorem,  the  implicit  function  theorem,  that  answers  this  question  affirmatively  in 
an  important  special  case.  To  facilitate  the  statement  of  this  theorem,  we  partition  the 
differential  matrix  of  F  :  R"+m  -+  Rm: 


-  ¥i_ 

Ml 

Ml 

M 

Ml 

Mi 

dx\ 

dx2 

3x„ 

dum 

dh 

Ml 

Mi 

Mi_ 

Mi 

Ml 

dx\ 

3X2 

3x„ 

du2 

dum 

Mm 

Mm 

Mm 

Mm 

Mm 

Mm 

-  dx\ 

3x2 

3x„ 

du\ 

du2 

dum 

or 

F'=  [Fx,Fu], 

where  Fx  is  the  submatrix  to  the  left  of  the  dashed  line  in  (6.4.5)  and  Fu  is  to  the  right. 

For  the  linear  transformation  (6.4.2),  Fx  =  A  and  Fy  =  B,  and  we  have  seen  that  the 
system  F(X,  U)  =  0  defines  U  as  a  function  of  X  for  all  X  in  R"  if  Fy  is  nonsingular.  The 
next  theorem  shows  that  a  related  result  holds  for  more  general  transformations. 


We  can  define  a  unique  continuous  solution  u  of  (6 
interior  point  of  5 .  For  example,  if  we  require  that 

M(7r7i)  = 

then  u  must  be  as  defined  by  (6.4.4). 

The  question  of  whether  an  arbitrary  system 
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Theorem  6.4.1  (The  Implicit  Function  Theorem)  Suppose thatF  :Rn+m 
M.m  is  continuously  differentiable  on  an  open  set  S  of  W+m  containing  (Xo,Uo).  Let 
F(Xo,  Uo)  =  0,  and  suppose  that  Fu(Xo,  Uo)  is  nonsingular.  Then  there  is  a  neighborhood 
M  of  (Xo,  Uo),  contained  in  S,  on  which  Fu(X,  U)  is  nonsingular  and  a  neighborhood  N 
o/Xo  in  W  on  which  a  unique  continuously  differentiable  transformation  G  :  W  — >  M.m 
is  defined,  such  that  G(Xq)  =  Uo  and 


(X,  G(X))  e  M    and    F(X,  G(X))  =  0    if  XeN. 


Moreover, 


G'(X)  =  -[Fu(X,  G(X))]-1FX(X,  G(X)),    X  e  N. 
Proof   Define  $  :  R"+m  ->  M"+m  by 


x2 


x« 

*(X,U)=  /!(X,U) 
/2(X,U) 

L  /m(X,U)  J 
or,  in  "horizontal"notation  by 

*(X,U)  =  (X,F(X,U)). 
Then  <I>  is  continuously  differentiable  on  S  and,  since  F(Xo ,  Uo)  =  0, 

4>(X0,Uo)  =  (X0,0). 


The  differential  matrix  of  $ 


is 


<t>' 


1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0  •■ 

0 

0 

0  • 

1 

0 

0 

0 

dA 

Ml 

3/i 

dx\ 

3x2 

3x„ 

3wi 

3M2 

3wm 

Sh 

dh 

Sh 

^2 

3/2 

dx\ 

3x2 

3x„ 

du\ 

3M2 

3wm 

Mm 

dfm 

Vm 

dfm 

<l/m 

3/m 

3X2 

3x„ 

3«2 

dum 

I 

Fx 


(6.4.6) 
(6.4.7) 


(6.4.8) 


(6.4.9) 
(6.4.10) 


0 

Fu 
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where  I  is  the  n  x  n  identity  matrix,  0  is  the  n  x  m  matrix  with  all  zero  entries,  and  Fx 
and  Fy  are  as  in  (6.4.5).  By  expanding  det(<I>')  and  the  determinants  that  evolve  from  it  in 
terms  of  the  cofactors  of  their  first  rows,  it  can  be  shown  in  n  steps  that 


=  det(fc') 


M 

Ml 

Mi 

du2 

dum 

M. 

M 

3/2 

3mi 

du2 

dum 

Mm 

Mm 

Mm 

dui 

du2 

dum 

=  det(Fu). 


In  particular, 

/•(Xo.Uo)  =  detCFuCXo.Uo)  ^  0. 
Since  $  is  continuously  differentiable  on  S,  Corollary  6.3.5  implies  that  $  is  regular  on 
some  open  neighborhood  M  of  (Xo,  Uo)  and  that  M  =  is  open. 

Because  of  the  form  of  $  (see  (6.4.8)  or  (6.4.9)),  we  can  write  points  of  M  as  (X,  V), 
where  V  e  Rm.  Corollary  6.3.5  also  implies  that  $  has  a  a  continuously  differentiable 
inverse  T(X,  V)  defined  on  M  with  values  in  M.  Since  $  leaves  the  "X  part"  of  (X,  U) 
fixed,  a  local  inverse  of  $  must  also  have  this  property.  Therefore,  T  must  have  the  form 


T(X.  V) 


x2 


MX,  v) 
MX,  V) 

L  A»(X,V)  J 


or,  in  "horizontal"  notation, 

r(x,  v)  =  (x,H(x,  v)), 

where  H  :  M"+m  -+  Rm  is  continuously  differentiable  on  M.  We  will  show  that  G(X) 
H ( X  0)  has  the  stated  properties. 

From  (6.4.10),  (Xo,  0)  €  M  and,  since  M  is  open,  there  is  a  neighborhood  N  of  Xo  in 
R"  such  that  (X,  0)  €  M  if  X  e  N  (Exercise  6.4.2).  Therefore,  (X,  G(X))  =  T  (X,  0)  e  Af 
if  X  e  AT.  Since  T  =  (X,0)  =  $(X,G(X)).  Setting  X  =  X0  and  recalling  (6.4.10) 
shows  that  G(X0)  =  Uo,  since  $  is  one-to-one  on  M. 
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Henceforth  we  assume  that  X  e  N .  Now, 

(X,  0)  =  $<T(X,  0))  (since  $  =  T-1) 

=  $  (X,  G(X))  (since  r  (X,  0)  =  (X,  G(X))) 

=  (X,  F(X,  G(X)))     (since  $(X,  U)  =  (X,  F(X,  U))). 

Therefore,  F(X,  G(X))  =  0;  that  is,  G  satisfies  (6.4.6).  To  see  that  G  is  unique,  suppose 
that  Gi  :  W  -+  W"  also  satisfies  (6.4.6).  Then 

$(X,G(X))  =  (X,  F(X,  G(X)))  =  (X,0) 

and 

*(X,Gi(X))  =  (X,F(X,Gi(X)))  =  (X,0) 
for  all  X'mN.  Since  $  is  one-to-one  on  M,  this  implies  that  G(X)  =  Gi  (X). 
Since  the  partial  derivatives 

Shi 

t, — ,     1  <  i  <  m,     1  <  j  <  n, 

dxj 

are  continuous  functions  of  (X.  V)  on  M,  they  are  continuous  with  respect  to  X  on  the 
subset  {(X,  0)  |  X  e  N}  of  M .  Therefore,  G  is  continuously  differentiable  on  N .  To  verify 
(6.4.7),  we  write  F(X,  G(X))  =  0  in  terms  of  components;  thus, 

fi(x1,x2,...,xn,g1(X),g2(X),...,gm(X))  =  0,     l<i<m,  X&N. 

Since  /;  and  g\,  g2,  gm  are  continuously  differentiable  on  their  respective  domains, 
the  chain  rule  (Theorem  5.4.3)  implies  that 

97HX,G(X))  ^3/-(X,G(X))3gr(X) 

 n  H  /   t,  t,  =0,     1  <  i  <  m,  1  <  j  <  n,  (6.4.11) 

ox;  '        dMr  ox; 

or,  in  matrix  form, 

FX(X,  G(X))  +  Fu(X,  G(X))G'(X)  =  0.  (6.4.12) 

Since  (X,  G(X))  e  M  for  all  X  in  and  FD(X,  U)  is  nonsingular  when  (X,  U)  e  M,  we 
can  multiply  (6.4.12)  on  the  left  by  F^j1^,  G(X))  to  obtain  (6.4.7).  This  completes  the 
proof.  TJ 

In  Theorem  6.4.1  we  denoted  the  implicitly  defined  transformation  by  G  for  reasons 
of  clarity  in  the  proof.  However,  in  applying  the  theorem  it  is  convenient  to  denote  the 
transformation  more  informally  by  U  =  U(X);  thus,  U(Xo)  =  Uo,  and  we  replace  (6.4.6) 
and  (6.4.7)  by 

(X,  U(X))  €  M    and     X(X,  U(X))  =0    if    X  €  N, 

and 

U'(X)  =  -[Fu(X,  U(X))]"1Fx(X,  U(X)),    X  e  N, 
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while  (6.4.11)  becomes 

dfi  v— v  dfi  dur 

-^  +  V-^-^=0,  l<i  <m,  l<j  <n,  (6.4.13) 
dxi  *— '  di/r  ox; 

r  =  l  J 

it  being  understood  that  the  partial  derivatives  of  wr  and  //  are  evaluated  at  X  and  (X,  U(X)), 
respectively. 

The  following  corollary  is  the  implicit  function  theorem  for  m  —  1 . 

Corollary  6.4.2  Suppose  that  f  :  R"+1  — >  R  is  continuously  differentiable  on  an 
open  set  containing  (Xo,Wo),  with  /(Xo,Wo)  —  0  and  /M(Xo,Mo)  7^  0.  Then  there  is  a 
neighborhood  M  o/(Xo ,  Mo),  contained  in  S,  and  a  neighborhood  N  0/X0  in  W  on  which 
is  defined  a  unique  continuously  differentiable  function  u  =  m(X)  :  W  — >  R  such  that 

(X,  w(X))  e  M    awd     /„(X,  w(X))  7^  0,    X  e  N, 

u(Xq)  =  M0,    and    /(X,  m(X))  =  0,  XeJV. 

77ie  partial  derivatives  ofu  are  given  by 

A,.(x,M(X)) 

w   (X)  =  —  ,     1  <  (  <  n. 

/M(X,u(X))         -  - 

Example  6.4.1  Let 

f(x,  y,  u)  =  1  —  x2  —  y2  —  u2 
and  (x0,  y0,  u0)  =  (±,  -±,  -±=).  Then  /(xo,  yo,  Zo)  =  0  and 

fx(x,  y,  u)  =  —2x,     fy(x,  y,  u)  =  -2y,     fu(x,  y,  u)  =  -2u. 

Since  /  is  continuously  differentiable  everywhere  and  fu(xo,  yo,  «o)  =  — v2  ^  0,  Corol- 
lary 6.4.2  implies  that  the  conditions 

1  —  x2  —  y2  —  u2  =  0,     w(l/2,-l/2)  =  4=> 

V2 

determine  1/  =  m(x,  j)  near  (x0.  yo)  =  (5,  —A)  so  that 

"x(^j)  =  - ^7  -;   =  —  r,  (6.4.14) 

fu(x,y,u(x,y))  u(x,y) 

and 

Uy{x,  y)  =  — —   =  —  -.  (6.4.15) 

fu(x,y,u(x,y))  u(x,y) 


It  is  not  necessary  to  memorize  formulas  like  (6.4.14)  and  (6.4.15).  Since  we  know  that 
/  and  u  are  differentiable,  we  can  obtain  (6.4.14)  and  (6.4.15)  by  applying  the  chain  rule 
to  the  identity 

f(x,y,u(x,y))  =  0. 
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Example  6.4.2  Let 

f(x,  y,  u)  =  x3y2u2  +  3xy4u4  -  3x6y6u7  +  I2x 
and  (xo,yo,  uq)  =  (1,-1,  1),  so  f(xo,  yo,  "o)  =  0.  Then 


13 


(6.4.16) 


fx(x,y,u)  =  3xzyzuz  +  3y4u4  -  18x5jbw7  +  12, 
fy(x,  y,  u)  =  2x3yu2  +  I2xy3u4  -  Wx6y5u7, 
fu(x,  y,  u)  =  2x3y2u  +  \2xy4u3  -  2lx6y6u6. 

Since  /u(l,  —  1,  1)  =  —7  ^  0,  Corollary  6.4.2  implies  that  the  conditions 

/(jc,y,M)  =  0,    m(1,-1)  =  1 

determine  u  as  a  continuously  differentiable  function  of  (x,  y)  near  (1,-1). 


(6.4.17) 


If  we  try  to  solve  (6.4.16)  for  u,  we  see  very  clearly  that  Theorem  6.4.1  and  Corol- 
lary 6.4.2  are  existence  theorems;  that  is,  they  tell  us  that  there  is  a  function  u  =  u(x,  y) 
that  satisfies  (6.4.17),  but  not  how  to  find  it.  In  this  case  there  is  no  convenient  formula  for 
the  function,  although  its  partial  derivatives  can  be  expressed  conveniently  in  terms  of  x, 
y,  and  u{x,  y): 


ux(x,  y)  =  - 


fx(x,y,u(x,y)) 


fu(x,y,u(x,y))' 
In  particular,  since  w(l,  —  1)  =  1, 


uy(x,y)  =  - 


fy(x,y,u(x,y)) 
fu(x,y,u(x,y))' 


K*(l,-1)  = 

Example  6.4.3  Let 

X  = 


0 

^7 


x 

y 

z. 


0,  Uy{\,-\) 


and     U  = 


and 


F(X,  U) 


2x2  +  y2  +  z2  +  u2  -  v2 
x2  +  z.2  +  2u  —  v 


If  X0  =  (1,  -1,  1)  and  U0  =  (0,  2),  then  F(X0,  U0)  =  0.  Moreover, 


Fu(X,U)  = 


2m  —  2v 
2  -1 


and    Fy  = 


Ax  2y  2z 
2x     0  2z 


det(Fu(Xo,Uo)) 


0  -4 
2  -1 
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Hence,  the  conditions 


F(X,U)  =  0,    U(l,-l,l)  =  (0,2) 

determine  U  =  U(X)  near  Xo.  Although  it  is  difficult  to  find  U(X)  explicitly,  we  can 
approximate  U(X)  near  Xq  by  an  affine  transformation.  Thus,  from  (6.4.7), 


U'(Xo)  =  -[FdXo.UfXo^FxCXcUtXo)) 


(6.4.18) 


"  0 

-4  " 

-l 

"  4 

-2 

2 

2 

-1 

2 

0 

2 

-1  4 
-2  0 


-2  2 
0  2 


4  2  6 
-8    4  -4 


Therefore, 


lim 

x-Ki  -i,i) 


m(x,  j) 
u(x,j) 


+ 


4  2  6 
-8    4  -4 


x  -  1 

y  +  i 

z,  -  1 


[(*  -  l)2  +  (y  +  l)2  +  (z  -  l)2] 


1/2 


Again,  it  is  not  necessary  to  memorize  (6.4.18),  since  the  partial  derivatives  of  an  implic- 
itly defined  function  can  be  obtained  from  the  chain  rule  and  Cramer's  rule,  as  in  the  next 
example. 


Example  6.4.4  Let  u  =  u(x,  y)  and  v  =  v(x,  y)  be  differentiable  and  satisfy 

x2  +  2y2  +  3z2  +  u2  +  v  =  6 
2x3  +  4y2  +  2z2  +  u  +  v2  =  9 

and 


(6.4.19) 


u(l,-l,0)  =  -1,    u(l,-l,0)  =  2. 
To  find  ux  and  vx,  we  differentiate  (6.4.19)  with  respect  to  x  to  obtain 

2x  +  2uux  +  vx  =  0 
6x2  +  ux  +  2vvx  =  0. 

Therefore, 


(6.4.20) 


2m 

1  " 

ux 

2x 

1 

2v 

vx 

6x2 
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and  Cramer's  rule  yields 


and 


2x  1 
6x2  2v 


2u  1 
1  2v 


2u  2x 
1  6x2 


2u  1 
1  2v 


if  4uv  y£  1.  In  particular,  from  (6.4.20), 


6x  — 4xv 
Auv  —  1 


2x  —  I2x2u 
Auv  —  1 


-2      2  14  14 

Ml,-l,0)=-  =  -,     vx(h-hO)  =  -=--. 


Jacobians 


It  is  convenient  to  extend  the  notation  introduced  in  Section  6.2  for  the  Jacobian  of  a  trans- 
formation F  :  Km  — >  M.m.  If  f\,  f2,  fm  are  real-valued  functions  of  k  variables, 
k  >  m,  and  b>  £m  are  any  m  of  the  variables,  then  we  call  the  determinant 


3/i 

3/i 

Ml 

3b 

3b 

3?m 

3/2 

3/2 

^2 

3b 

3b 

3?m 

Mm 

3/m 

3b 

3b 

9£m 

the  Jacobian  of  f\,  fi, . . . ,  fm  with  respect  to  b»  b»  •  •  •>  %m-  We  denote  this  Jacobian  by 

3(./i-/>.  ./,„) 

3(b.b,....?m) ' 

and  we  denote  the  value  of  the  Jacobian  at  a  point  P  by 

3(/1;/2,...,/m) 


3(b-b 


■  bo 


Example  6.4.5  If 


F(x,j,z)  = 


3x2  +  2x_y  +  z2 
4jc2  +  2;cy2  +  z3 
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then 


and 


Hfi,fi) 

d(x,y) 


6x  +  2y  2x 
8x  +  2y2  Axy 


3(/l,/2) 


2x  2z. 
4xy  3z,2 


3(/i,/2) 


2z,  6x  +  2y 
3z2    8x  +  2y2 


d(z„x) 

The  values  of  these  Jacobians  at  Xq  =  (— 1,  1,  0)  are 


3(/i,/2) 


d(x,y) 


x» 


-4  -2 
-6  -4 


=  4, 


3(/i,/2) 


-2 

0 

-4 

0 

Xo 

=  o, 


and 


3(/i,/2) 

0  -4 

3(z,jc) 

Xo 

0  -6 

=  0. 


The  requirement  in  Theorem  6.4.1  that  Fy  (Xo ,  Uo )  be  nonsingular  is  equivalent  to 

3(/i,/2,...,/M) 


3(Mi,  M2,  .  .  .  ,  Mm) 


^0. 


(Xo,U0) 

If  this  is  so  then,  for  a  fixed  j ,  Cramer's  rule  allows  us  to  write  the  solution  of  (6.4. 1 3)  as 

d(fi,f2,...,fi,...,fm) 
dui  _  _d(iti,u2  ,Xj,...,um) 

^""jgX^j  fm) 

3(mi,  U2,  .  .  .  ,Ui,  . . . ,  um) 

Notice  that  the  determinant  in  the  numerator  on  the  right  is  obtained  by  replacing  the  ith 
column  of  the  determinant  in  the  denominator,  which  is 


1  <  i  <  m, 


-  dj\  - 

"  dfi_  ~ 

dxj 

•  by 

dxj 

dfm 

_  dui  _ 

.  dxj  _ 

So  far  we  have  considered  only  the  problem  of  solving  a  continuously  differentiable 
system 

F(X,  U)  =  0    (F  :  R"+m  -+  Em)  (6.4.21) 

for  the  last  m  variables,  w i ,  w2,  •  •  • ,  in  terms  of  the  first  n,  X\ ,  x2j  . . . ,  xn .  This  was 
merely  for  convenience;  (6.4.21)  can  be  solved  near  (Xo,  Uo)  for  any  m  of  the  variables  in 
terms  of  the  other  n,  provided  only  that  the  Jacobian  of  f\,  f%,  fm  with  respect  to  the 
chosen  m  variables  is  nonzero  at  (Xo,  Uo).  This  can  be  seen  by  renaming  the  variables  and 
applying  Theorem  6.4.1. 
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f(x,y,z) 
g(x,y,z) 


Example  6.4.6  Let 

¥(x,y,z)  = 

be  continuously  differentiable  in  a  neighborhood  of  {xq,  yo,Zo)-  Suppose  that 

F(xo,J>o,Zo)  =  0 

and 


d(x,z)  . 

(xQ,yQ,ZQ) 

Then  Theorem  6.4.1  with  X  =  (y)  and  U  =  (x,  z)  implies  that  the  conditions 
f(x,y,z)  =  0,    g(x,y,z)  =  0,    x(y0)  =  x0,    z(y0)  =  z0, 


(6.4.22) 


(6.4.23) 


determine  x  and  z  as  continuously  differentiable  functions  of  y  near  yo .  Differentiating 
(6.4.23)  with  respect  to  y  and  regarding  x  and  z  as  functions  of  y  yields 

fxX'  +  fy  +  fzZ'  =  0 
gxX'  +  gy  +  gZz!  =  0. 


Rewriting  this  as 


fxX'  +  fzZ'  =  ~fy 

gxX'  +  gzZ,'  =  ~gy, 


and  solving  for  x'  and  z!  by  Cramer's  rule  yields 

~fy  fz 
~gy  gz 


and 


z  = 


fx  fz 

gx  gz 

fx  -fy 

gx  ~gy 


fx  fz 


d(f,g) 
d(y,z) 
'd(fg) 
d(x,z) 

d(fg) 
B(x,y) 
3(/.g)' 
d(x,z) 


(6.4.24) 


(6.4.25) 


Equation  (6.4.22)  implies  that  d(f,  g)/d(x,  z)  is  nonzero  if  y  is  sufficiently  close  to  yo- 


Example  6.4.7  LetX0  =  (1, 1,2)  and 

F(W)=K(wr 

g(x,y,z) 


6x  +  6y  +  4z3  -  44 
-x2-y2  +  8z-l4 
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Then  F(X0)  =  0, 


and 


B(f,g) 


6  12z2 
-2x  8 


Hfg) 

6  48 

(1,1,2) 

-2  8 

=  144  ^  0. 

Therefore,  Theorem  6.4.1  with  X  =  (y)  and  U  =  (x,  z)  implies  that  the  conditions 

f(x,y,z)  =  0,    g(x,y,z)  =  0, 

and 


x(l)=l,  z(l)=2, 


(6.4.26) 


determine  x  and  z  as  continuously  differentiable  functions  of  y  near  yo  =  1 .  From  (6.4.24) 
and  (6.4.25), 


and 


Kfg) 
Hy,z) 

6 
-2j 

12z2 
8 

2  +  yz.2 

Kfg) 

6 

12z2 

2  +  xz2 

3(jc,z) 

—2.x 

8 

9(/,g) 

6 

6 

— 2x 

-2y 

y-x 

3(/,g) 
3(x,z) 

6 
—2.x 

12z2 
8 

4  +  2jcz2' 

These  equations  hold  near  y  =  1.  Together  with  (6.4.26)  they  imply  that 

x'(l)  =  -l,    z'(l)  =  0. 

Example  6.4.8  Continuing  with  Example  6.4.7,  Theorem  6.4.1  implies  that  the  con- 
ditions 

fix,  y,  z)  =  0,    g(x,  y,  z)  =  0,    y(l)  =  1,    z(l)  =  2 
determine  y  and  z  as  functions  of  x  near  xq  =  1,  since 

Kfg) 


B(y,z) 


6  12z2 
2y  8 


and 


3(/,s) 


B(y,z) 


6  48 

-2  8 

(1,1,2) 

144  ^  0. 


However,  Theorem  6.4.1  does  not  imply  that  the  conditions 

f(x,y,z)  =  0,    g(x,y,z)  =  0,    x(2)  =  1,    y(2)  =  1 
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define  x  and  y  as  functions  of  z.  near  z.o  —  2,  since 

d(f,g) 


d(x,y) 


6 

2x 


6 

-2y 


and 


6  6 

3(x,y) 

(1,1,2) 

-2  -2 

0. 


We  close  this  section  by  observing  that  the  functions  u\,  Uj,  . . . ,  um  defined  in  Theo- 
rem 6.4. 1  have  higher  derivatives  if  f\ ,  f%, . . . ,  fm  do,  and  they  may  be  obtained  by  differ- 
entiating (6.4.13),  using  the  chain  rule.  (Exercise  6.4.17). 


Example  6.4.9  Suppose  that  it  and  v  are  functions  of  (x,  y)  that  satisfy 
f(x,  y,  u,  v)  =  x  —  u2  —  v2  +  9  =  0 
g(x,  y,  u,  v)  =  y  —  u2  +  v2  —  10  =  0. 

Then 


3(/,g)  _ 
9(m,  v) 

From  Theorem  6.4.1,  if  uv  ^  0,  then 
ux  = 


—2u  —2v 
—2m  2v 


=  —mv. 


1  d(f,g) 

1 

1  - 

-2v 

1 

8uv  d(x,  v) 

8mt; 

0 

2v 

4m' 

1  B(f,g) 

1 

0  - 

-2v 

1 

8uv  d(y,  v) 

%uv 

1 

2v 

4m' 

1  3(f,g) 

1 

—2m 

1 

1 

%uv  3(m,  x) 

8mu 

—2m 

0 

Av' 

1  3(J,g) 

1 

—2m 

0 

1 

8uv  9(m,  y) 

8uv 

—2m 

1 

Av' 

These  can  be  differentiated  as  many  times  as  we  wish.  For  example, 

_  ux  1 

4m2  16m3 
uy  1 


4m2  16m3' 

and 

-  —  -  1 
Vyx  ~  4^2  ~  16u2" 
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6.4  Exercises 


1.     Solve  forU  =  (u, . . . )  as  a  function  of  X  =  (x, . . .). 


(a) 


"  1 

1  " 

u 

'  1 

-1  " 

X 

'  o  " 

1 

-1 

V 

+ 

2 

-3 

_  y  _ 

0 

u  —  v  +  w+3x  +  2y  =  0 
(b)     —u  +  v  +  w—  x  +   y  =  0 
u  +  v  —  w         +   y  =  0 

/  s  3m  +  i>  +  j  =  sin* 
^  '       u  +  2v  +  x  =  sin  j 

2m  +  2v  +   w  +  2x  +  2y  +   z.  =  0 
(d)       m  -   d  +  2w  +  x-  y  +  2z.  =  0 
3m  +  2v  —   w  +  2>x  +  2y  —  z.  =  0 

2.  Suppose  that  X0  e  R"  and  U0  e  Rm.  Prove:  If  is  a  neighborhood  of  (X0,  U0) 
in  Rn+m,  there  is  a  neighborhood     of  X0  in  R"  such  that  (X,  Uo)  e  N\  if  X  e  JV. 

3.  Let  (Xo,Uo)  be  an  arbitrary  point  in  R"+m.  Give  an  example  of  a  function  F  : 
Rn+m  Rm  such  that  F  is  continuousiy  differentiable  on  R"+m,  F(X0,  U0)  =  0, 
Fu(X0,  U0)  is  singular,  and  the  conditions  F(X,  U)  =  0  and  U(X0)  =  Y0 

(a)  determine  U  as  a  continuously  differentiable  function  of  X  for  all  X; 

(b)  determine  U  as  a  continuous  function  of  X  for  all  X,  but  U  is  not  differentiable 
atX0; 

(c)  do  not  determine  U  as  a  function  of  X. 

4.  Let  m  =  u(x,  y)  be  determined  near  (1,  1)  by 

x2yu  +  2xy2u3  -  3x3y3u5  =  0,    u{\,  1)  =  1. 

Find  ux(l,  1)  and  uy{\,  1). 

5.  Let  m  =  u(x,  y,  z)  be  determined  near  (1,  1, 1)  by 

x2y5z,2u5  +  2xy2u3  -  3x3z,2u  =  0,    u(l,  1,  1)  =  1. 

Find  Ml,  1, 1),ML  !>  l),andMz(l,  1, 1). 

6.  Find  u(x0,  y0),  ux(x0,  y0),  and  uy(x0,  Jo)- 

(a)  2x2  +  y2  +  ueu  =  6,    (x0,  yo)  =  (1,2) 

(b)  u(x  +  \)  +  x(y  +  2)  +  y(u-2)  =  0,    (x0,  jo)  =  (~L  -2) 

(c)  1  -  eM  sin(x  +  y)  =  0,     (x0,  y0)  =  (ir/4,iz/4) 

(d)  x  log u  +  y  log x  +  u  log  j  =  0,    (x0,  jo)  =  (L  1) 


432    Chapter  6  Vector-Valued  Functions  of  Several  Variables 


7.  Find  u(xo,  yo),  ux{xq,  yo),  and  uy(xo,  yo)  for  all  continuously  differentiable  func- 
tions u  that  satisfy  the  given  equation  near  (xq,  yo). 

(a)  2x V  -  3uxy3  +  u2x4y3  =  0;    (x0,y0)  =  (1,1) 

(b)  cos  u  cosx  +  sinw  sinj>  =  0;    (xo,  yo)  =  (0,  n) 

8.  Suppose  that  U  =  (u,  v)  is  continuously  differentiable  with  respect  to  (x,  y,  z)  and 
satisfies 

x2  +  4y2  +  z2  -  2u2  +  v2  =  -4 
(x  +  z,)2  +  u  —  v  =  —  3 

and 

M(1,I,-1)  =  -2,    w(l,l,-l)  =  l. 

FindU'(l,  3,-1). 

9.  Let  m  and  u  be  continuously  differentiable  with  respect  to  x  and  satisfy 

u  +  2u2  +  v2  +  x2  +  2v  —  x  =  0 
xuv  +  e"  sin(v  +  x)  =  0 

and  u(0)  =  v(0)  =  0.  Find  w'(0)  and  j;'(0). 

10.  Let  U  =  (u,  v,  w)  be  continuously  differentiable  with  respect  to  (x,  y)  and  satisfy 

x2y  +  xy2  +  u2  —  (v  +  w)2  =  —3 
ex+y  —  u  —  v  —  w  =  —  2 
(x  +  y)2  +  u  +  v  +  w2  =  3 

andU(l,-l)  =  (1,2,0).  Find U'(l, -1). 

11.  Two  continuously  differentiable  transformations  U  =  (u,  v)  of  (x,  y)  satisfy  the 
system 

xyu  —  4yu  +  9xv  =  0 

2xy  -  3y2  +  v2  =  0 

near  (xo,  yo)  =  (1,  1)-  Find  the  value  of  each  transformation  and  its  differential 
matrix  at  (1,  1). 

12.  Suppose  that  u,  v,  and  w  are  continuously  differentiable  functions  of  (x,  y,  z)  that 
satisfy  the  system 

ex  cos  y  +  ez  cos  u  +  ev  cos  w  +  x  =  3 
ex  sin  y  +  ez  sin  u  +  ev  cos  w  =1 
ex  tan  y  +  ez  tan  w  +     tan  w  +  z,  =  0 

near  (x0,yo,Zo)  =  (0,  0,  0),  and  w(0,  0,  0)  =  u(0,0,0)  =  w(0,0,0)  =  0.  Find 
ux(0, 0, 0),  vx(0, 0, 0),  and  wx(0, 0, 0). 
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13.     LetF=  (/,  g,  h)  be  continuously  differentiable  in  a  neighborhood  of  Po  =  (xo,  yo,  Zo,  uo,  vo), 
F(P0)  =  0,  and 


Po 


d(y,  z,  u) 

Then  Theorem  6.4.1  implies  that  the  conditions 

F(x,  y,z,u,  v)  =  0,    y(x0,  v0)  =  u0,    z(x0,v0)  =  z.o,    u(x0,v0)  =  u0 

determine  y,  z,  and  u  as  continuously  differentiable  functions  of  (x,  v)  near  (xo,  i>o)- 
Use  Cramer's  rule  to  express  their  first  partial  derivatives  as  ratios  of  Jacobians. 

14.  Decide  which  pairs  of  the  variables  x,  y,  z,  u,  and  v  are  determined  as  functions  of 
the  others  by  the  system 

x  +  2y  +  3z,  +  u  +  6v  =  0 
2x  +  4y  +  z  +  2u  +  2v  =  0, 

and  solve  for  them. 

15.  Let  y  and  v  be  continuously  differentiable  functions  of  (x,  z,  u)  that  satisfy 

x2  +  Ay2  +  z.2  -  2u2  +  v2  =  -4 
(x  +  z,)2  +  u  —  v  =  —7> 

near  (xo,  Zo,  Mo)  =  (T  —1,  —2),  and  suppose  that 

3,(1,-1,-2)  =  1  «(1,-1,-2)=1. 

Find  ^(1,-1,-2)  and  u„(l,  -1,  -2). 

16.  Let  m,  u,  and  x  be  continuously  differentiable  functions  of  (w,  y)  that  satisfy 

x2y  +  xy2  +  u2  -  (v  +  w)2  =  -3 
ex+y  -u-v-w  =  -2 
(x  +  y)2  +  u  +  v  +  w2  =  3 

near  (u>o,  yo)  =  (0,  —1),  and  suppose  that 

w(0,-l)=l,     u(0,-l)  =  2,  x(0,-l)=l. 

Find  the  first  partial  derivatives  of  u,  v,  and  x  with  respect  to  y  and  w  at  (0,  —1). 

17.  In  addition  to  the  assumptions  of  Theorem  6.4.1,  suppose  that  F  has  all  partial 
derivatives  of  order  <  q  in  5.  Show  that  U  =  U(X)  has  all  partial  derivatives 
of  order  <  q  in  N . 
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18.  Calculate  all  first  and  second  partial  derivatives  at  (xo,  yo)  =  (1,1)  of  the  functions 
u  and  v  that  satisfy 

x2  +  y2  +  u2  +  v2  =  3        .     .  ,  . 

,       ,       ,          ,      w(l,  1)  =  0,    u(l,  1)  =  1. 
x+y+u+v=3, 

19.  Calculate  all  first  and  second  partial  derivatives  at  (xo,  yo)  =  (1,  —1)  of  the  func- 
tions u  and  d  that  satisfy 

u2  -  v2  =  x  -  y  -  2 

2uv  =  x  +  y-2,      "(1.-D  =  -1.     «(1.-D  =  1- 

20.  Suppose  that  f\,  f2,        fn  are  continuously  differentiable  functions  of  X  in  a 
region  5  in  W ,  <p  is  continuously  differentiable  function  of  U  in  a  region  T  of  W , 

(/,(X),/2(X),...,/„(X))er,  XeS, 

<H/i(X),/2(X),...,/„(X))  =  0,  XeS, 


and 


Show  that 


£>2.(U)>0,  UeT. 
./=1 

yi,/i,-,/,)=0>  XeS_ 

9(xi,x2, . . .  ,x„) 


CHAPTER  7 


Integrals  of  Functions 
of  Several  Variables 


IN  THIS  CHAPTER  we  study  the  integral  calculus  of  real-valued  functions  of  several 
variables. 

SECTION  7.1  defines  multiple  integrals,  first  over  rectangular  parallelepipeds  in  W  and 
then  over  more  general  sets.  The  discussion  deals  with  the  multiple  integral  of  a  function 
whose  discontinuities  form  a  set  of  Jordan  content  zero,  over  a  set  whose  boundary  has 
Jordan  content  zero. 

SECTION  7.2  deals  with  evaluation  of  multiple  integrals  by  means  of  iterated  integrals. 

SECTION  7.3  begins  with  the  definition  of  Jordan  measurability,  followed  by  a  derivation 
of  the  rule  for  change  of  content  under  a  linear  transformation,  an  intuitive  formulation  of 
the  rule  for  change  of  variables  in  multiple  integrals,  and  finally  a  careful  statement  and 
proof  of  the  rule.  This  is  a  complicated  proof. 

7.1  DEFINITION  AND  EXISTENCE  OF  THE  MULTIPLE  IN- 
TEGRAL 

We  now  consider  the  Riemann  integral  of  a  real-valued  function  /  defined  on  a  subset  of 
R",  where  n  >  2.  Much  of  this  development  will  be  analogous  to  the  development  in 
Sections  3.1-3  for  n  =  1,  but  there  is  an  important  difference:  for  n  =  1,  we  considered 
integrals  over  closed  intervals  only,  but  for  n  >  1  we  must  consider  more  complicated 
regions  of  integration.  To  defer  complications  due  to  geometry,  we  first  consider  integrals 
over  rectangles  in  W ,  which  we  now  define. 

Integrals  over  Rectangles 
The 

Si  x  S2  x  •  •  •  x  S„ 

of  subsets  Si,  S2,  . . . ,  S„  of  R  is  the  set  of  points  (x\,X2,  ■  ■  ■ ,  x„)  in  R"  such  that  x\  e 
Si ,  x2  e  S2, . . . ,  x„  e  S„ .  For  example,  the  Cartesian  product  of  the  two  closed  intervals 
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[ai,&i]  x  [a2,  b2]  =  {(x,  y)  |  a\  <  x  <  b\,  a2  <  y  <  b2} 
is  a  rectangle  in  R2  with  sides  parallel  to  the  x-  and  j-axes  (Figure  7.1.1). 

y 


> 

1  V 

a 

i  * 

1 

X 


Figure  7.1.1 

The  Cartesian  product  of  three  closed  intervals 

[a\,b\]  x  [a2,b2]  x  [a3,b3]  =  {{x,y,z)  \  a\  <  x  <bi,  a2  <  y  <b2,  a3  <  z  <  b3} 
is  a  rectangular  parallelepiped  in  R3  with  faces  parallel  to  the  coordinate  axes  (Figure  7.1.2). 


x 


Figure  7.1.2 
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Definition  7.1.1  A  coordinate  rectangle  R  in  R"  is  the  Cartesian  product  of  n  closed 
intervals;  that  is, 

R  =  [ai.ii]  x  [a2,b2]  x  ••■  x  [an,bn]. 

The  content  of  i?  is 

=  (Z>i  -  ai)(b2  -  a2)  -  ■  ■  (b„  -  a„). 

The  numbers  b\  —  a\,  b2  —  a2,  bn  —  an  are  the  edge  lengths  of  R.  If  they  are  equal, 
then  R  is  a  coordinate  cube.  If  ar  =  br  for  some  r,  then  V(R)  =  0  and  we  say  that  R  is 
degenerate;  otherwise,  R  is  nondegenerate.  ■ 

If  n  =  1,  2,  or  3,  then  V(R)  is,  respectively,  the  length  of  an  interval,  the  area  of  a 
rectangle,  or  the  volume  of  a  rectangular  parallelepiped.  Henceforth,  "rectangle"  or  "cube" 
will  always  mean  "coordinate  rectangle"  or  "coordinate  cube"  unless  it  is  stated  otherwise. 

If 

R  =  [ai,b\]  x  [a2,b2]  x  -x  [an,b„] 

and 

Pr:  ar  =  ar0  <  ar\  <  ■  ■  ■  <  arm,.  =  br 

is  a  partition  of  [ar,  br],  1  <  r  <  n,  then  the  set  of  all  rectangles  in  R"  that  can  be  written 

as 

[aijj-i.ai/J  x  [a2j2-i,a2j2]  x  ■•■  x  [a„jn-i,a„jn],     1  <  jT  <mr,     1  <  r  <  n, 

is  a  partition  of  R.  We  denote  this  partition  by 

P  =  PiX  P2x---x  Pn  (7.1.1) 

and  define  its  norm  to  be  the  maximum  of  the  norms  of  Pi,  P2,  Pn,  as  defined  in 
Section  3.1;  thus, 

||P ||  =  max{||Pi||,  ||P2||,...,||P„||}. 
Put  another  way,  ||  P  \\  is  the  largest  of  the  edge  lengths  of  all  the  subrectangles  in  P. 

Geometrically,  a  rectangle  in  R2  is  partitioned  by  drawing  horizontal  and  vertical  lines 
through  it  (Figure  7.1.3);  in  R3,  by  drawing  planes  through  it  parallel  to  the  coordinate  axes. 
Partitioning  divides  a  rectangle  R  into  finitely  many  subrectangles  that  we  can  number  in 
arbitrary  order  as  R\,  R2,       R^.  Sometimes  it  is  convenient  to  write 

P  ={RltR2,...,Rk} 


rather  than  (7.1.1). 
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Figure  7.1.3 

If  P  =  Pj  x^x-'-xf,  and  P'  =  ^  x  x--  -  x  are  partitions  of  the  same 
rectangle,  then  P'  is  a  refinement  of  P  if  P/  is  a  refinement  of  P,-,  1  <  i  <  «,  as  defined  in 
Section  3.1. 

Suppose  that  /  is  a  real-valued  function  defined  on  a  rectangle  R  inR",  P  =  {Pi,  R2,  ■  ■  ■ , 
is  a  partition  of  R,  and  Xy  is  an  arbitrary  point  in  Rj,l  <  j  <  k.  Then 

k 
7  =  1 

is  a  Riemann  sum  of  f  over  P .  Since  Xy  can  be  chosen  arbitrarily  in  Py ,  there  are  infinitely 
many  Riemann  sums  for  a  given  function  /  over  any  partition  P  of  P. 

The  following  definition  is  similar  to  Definition  3.1.1. 

Definition  7.1.2  Let  /  be  a  real-valued  function  defined  on  a  rectangle  R  in  R".  We 
say  that  /  is  Riemann  integrable  on  R  if  there  is  a  number  L  with  the  following  property: 
For  every  e  >  0,  there  is  a  S  >  0  such  that 

\a-L\  <e 

if  a  is  any  Riemann  sum  of  /  over  a  partition  P  of  P  such  that  ||  P  ||  <  S.  In  this  case,  we 
say  that  L  is  the  Riemann  integral  of  f  over  R,  and  write 

/  f(X)dX  =  L.  ■ 
Jr 

If  R  is  degenerate,  then  Definition  7. 1 .2  implies  that  fR  f  (X)  c?X  =  0  for  any  function  / 
defined  on  P  (Exercise  7.1.1).  Therefore,  it  should  be  understood  henceforth  that  whenever 
we  speak  of  a  rectangle  in  R"  we  mean  a  nondegenerate  rectangle,  unless  it  is  stated  to  the 
contrary. 
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The  integral  fR  f  (X)dX  is  also  written  as 

/  f(x,y)d(x,y)    (n  =  2),     /  f(x,y,z)d(x,y,z)    (n  =  3), 

JR  JR 

or 

/  fix  i,  x%, . . . ,  Xn)  d{x\,  X2,  ■  ■  ■ ,  Xn)    (n  arbitrary). 
JR 

Here  dX  does  not  stand  for  the  differential  of  X,  as  defined  in  Section  6.2.  It  merely 
identifies  xi,  X%,  xn,  the  components  of  X,  as  the  variables  of  integration.  To  avoid  this 
minor  inconsistency,  some  authors  write  simply  fR  f  rather  than  fR  f  (X)  dX. 

As  in  the  case  where  n  =  1,  we  will  say  simply  "integrable"  or  "integral"  when  we 
mean  "Riemann  integrable"  or  "Riemann  integral."  If  n  >  2,  we  call  the  integral  of  Defi- 
nition 7.1.2  a  multiple  integral;  for  n  =  2  and  n  =  3  we  also  call  them  double  and  triple 
integrals,  respectively.  When  we  wish  to  distinguish  between  multiple  integrals  and  the 
integral  we  studied  in  Chapter  (n  =  1),  we  will  call  the  latter  an  ordinary  integral. 

Example  7.1.1  Find  fR  f(x,  y)  d{x,  y),  where 

R  =  [a,  b]  x  [c,  d] 

and 

f(x,y)  =  x  +  y. 

Solution  Let  P\  and  P-i  be  partitions  of  [a,  b]  and  [c.  d\;  thus, 

P\  :  a  =  xq  <  x\  <  ■  ■  ■  <  xr  =  b    and     P-i  :  c  =  yo  <  y\  <  ■  ■  ■  <  ys  =  d, 
A  typical  Riemann  sum  of  /  over  P  =  P\  x  P2  is  given  by 

r  s 

a  =  +  wNxi  -  xi-i)(yj  -  yj-iX  (7-1-2) 

!=1  j  =  l 

where 

Xi-i  <  f-tj  <  Xi    and  ^  Wj  ^  X/-  (7.1.3) 

The  midpoints  of         Xi]  and  yj]  are 


Xi+xi-i  _       yj+yj-i  „.„ 

Xi  =   and    yj  =  — ,  (7.1.4) 


and  (7.1.3)  implies  that 


fe-x,-|<^<^<^  (7.1.5) 


and 
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Now  we  rewrite  (7.1.2)  as 

r  s 

i=\j=\ 

r  s 

+X!  X!  +      -7j)]  (xi  -  Xi-i)(yj  -  yj-i). 

i=\  j=i 

To  find  fR  f(x,  y)  d(x,  y)  from  (7.1.7),  we  recall  that 

r  s 

^T(xi  -  Xj-i)  =  b-a,    J2(yj  -  yj-i)  =  d  -c 
i=i  j=\ 

(Example  3.1.1),  and 


(7.1.7) 


(7.1.8) 


J2(xf-xf_1)  =  b2-a2,     Y,(y2j-y2j-i)  =  d2-c2  (7.1.9) 

1=1  7=1 

(Example  3.1.2). 

Because  of  (7.1.5)  and  (7.1.6)  the  absolute  value  of  the  second  sum  in  (7.1.7)  does  not 
exceed 


l^llEI>f-*f-i)to-^-i)  =  ll*l 
7=17=1 


^2(Xi  -Xi-l) 


.7=1 


7=1 


\P\\(b-a)(d-c) 


(see  (7.1.8)),  so  (7.1.7)  implies  that 


1=1  /  =  ! 


<  \\P\\(b-a)(d  -c).  (7.1.10) 


It  now  follows  that 

r  s 

J2J2xi(xi  -xt-i)(jj  -  yj-i)  = 
/=i/=i 


^XiiXi  -  Xj-i) 


i=l 


^(yj-yj-0 


=  (d  -  c)^2xi(xi  -  Xi-i)    (from  (7.1.8)) 


i=l 


d  —  c 


Ypn-Xi-x)       (from  (7.1.4)) 


</-c 


(=i 


-(62-a2) 


(from  (7.1.9)). 


Similarly, 


(=i/=i 
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Therefore,  (7.1.10)  can  be  written  as 

h -n  . 

<  \\P\\(b-a)(d  -c). 


a-^(b2-a2)-^(d2-c2} 


Since  the  right  side  can  be  made  as  small  as  we  wish  by  choosing  ||  P  ||  sufficiently  small, 

f  (x  +  y)  d(x,  y)  =  -  [(d  -  c)(b2  -  a2)  +  (b  -  a)(d2  -  c2)] . 
Jr  2 


Upper  and  Lower  Integrals 

The  following  theorem  is  analogous  to  Theorem  3.1.2. 

Theorem  7.1.3  // /  is  unbounded  on  the  nondegenerate  rectangle  R  in  W ,  then  f  is 
not  integrable  on  R. 

Proof  We  will  show  that  if  /  is  unbounded  on  R,  P  =  {Ri,  R2,  ■  ■  ■ ,  Rk}  is  any  parti- 
tion of  R,  and  M  >  0,  then  there  are  Riemann  sums  a  and  a'  of  /  over  P  such  that 

\a-a'\>M.  (7.1.11) 

This  implies  that  /  cannot  satisfy  Definition  7.1.2.  (Why?) 
Let 

k 
./  =  ! 

be  a  Riemann  sum  of  /  over  P .  There  must  be  an  integer  i  in  {1,2, ...  ,k}  such  that 

|/(X)  -  f(Xt)\  >  (7.1.12) 
for  some  X  in  Ri ,  because  if  this  were  not  so,  we  would  have 

|/(X)-/(Xy)|  <-^-,    XeRj,  l<j<k. 

V  (Kj  ) 

If  this  is  so,  then 

|/(X)|  =  \f(Xj)  +  f(X)  -  f(Xj)\  <  \f(Xj)\  +  |/(X)  -  f(Xj)\ 
M 

<  \f(Xj)\  +  ,    X&Rj,  \<i<k. 

However,  this  implies  that 

M 


\f{X)\  <  max  j  \f(Xj)\  +  I  1  <  j  <  k\  ,  XcR, 

which  contradicts  the  assumption  that  /  is  unbounded  on  R. 
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Now  suppose  that  X  satisfies  (7.1.12),  and  consider  the  Riemann  sum 

n 

a'  =  J^f&'jWRj) 

7  =  1 


over  the  same  partition  P ,  where 

"     |X.  ./  /. 


Since 

\a-o'\  =  \f(X)-f(Xi)\V(Ri), 
(7.1.12)  implies  (7.1.11).  H 

Because  of  Theorem  7.1.3,  we  need  consider  only  bounded  functions  in  connection  with 
Definition  7.1.2.  As  in  the  case  where  n  =  1,  it  is  now  convenient  to  define  the  upper 
and  lower  integrals  of  a  bounded  function  over  a  rectangle.  The  following  definition  is 
analogous  to  Definition  3.1.3. 

Definition  7.1.4  If  /  is  bounded  on  a  rectangle  R  in  W  and  P  =  {Rlt  R2, . . . ,  Rk} 
is  a  partition  of  R,  let 

Mj  =  sup  /(X),    mj  =  inf  /(X). 
XeRj  XeRi 

The  upper  sum  of  /  over  P  is 

k 

S(P)  =  J2MJV(Rj), 
and  the  upper  integral  of  f  over  R,  denoted  by 

Jr 

is  the  infimum  of  all  upper  sums.  The  lower  sum  of  f  over  P  is 

k 

s(P)  =  YjmjV{RJ), 
j=i 

and  the  lower  integral  of  f  over  R,  denoted  by 


f  f&)dX, 
Jr 


is  the  supremum  of  all  lower  sums. 


The  following  theorem  is  analogous  to  Theorem  3.1.4. 
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Theorem  7.1.5  Let  f  be  bounded  on  a  rectangle  R  and  let  P  be  a  partition  of  R. 
Then 

(a)  The  upper  sum  S(P)  of  f  over  P  is  the  supremum  of  the  set  of  all  Riemann  sums  of 
f  over  P. 

(b)  The  lower  sum  i(P)  of  f  over  P  is  the  infimum  of  the  set  of  all  Riemann  sums  of  f 
over  P. 

Proof   Exercise  7.1.5. 
If 

m  <  f{X)  <  M  forXinfl, 

then 

mV(R)  <  s(P)  <  S(P)  <  MV(R); 
therefore,  fR  f  (X)  dX  and  JR  f  (X)  dX  exist,  are  unique,  and  satisfy  the  inequalities 


/  /(X). 
Jr 


mV(R)  <  /  f(X)dX<MV{R) 
and 

mV(R)  <  /  f(X)dX  <  MV(R). 
Jr 

The  upper  and  lower  integrals  are  also  written  as 

/  f{x,y)d{x,y)    and     /  f(x,  y)  d(x,  y)    (n  =  2), 
Jr  Jr_ 

/  f(x,y,z)d(x,y,z)  and  /  f(x,  y,  z)  d(x,  y,  z)  (n  =  3), 
Jr  Jr 


or 


and 


/  f(xi,x2,  ...,xn)d(xi,x2,  ...,x„) 
Jr 

I  f(xi,X2,...,x„)d(xi,x2,...,x„)       (n  arbitrary). 
Jr 

Example  7.1.2  Find  fRf(x,y)d(x,y)  and  fRf(x,y)d(x,y),  with  R  =  [a,b]x 
[c,  d]  and 

f(x,y)  =  x  +  y, 

as  in  Example  7.1.1.  ■ 

Solution  Let  f  i  and  P2  be  partitions  of  [a,  b]  and  [c,  d\\  thus, 

P\  :  a  =  xq  <  x\  <  ■  ■  ■  <  xr  =  b    and     P2  '■  c  =  yo  <  y\  <  ■  ■  ■  <  ys  =  d . 
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The  maximum  and  minimum  values  of  /  on  the  rectangle  [x,-  _  i ,  x;  ]  x  [yj  _  i ,  yj  ]  are  X;  +  yj 
and  Xj-i  +  y       respectively.  Therefore, 

r  s 

t=i  j=i 


and 


By  substituting 


<p)  =  E  E^'-1  +  yj-dte  -  x'-^(yj  -  yj-J-  (7-L14) 

i=\ j  =  l 


xi  +  yj  =  ^[(xi  +  Xi-i)  +  {yj  +  yj-i)  +  (x*  -  x,_i)  +  (yj  -  yj-i)] 


into  (7.1.13),  we  find  that 


=         +E2  +  E3  +  5:4),  (7.1.15) 


where 


E,  =  £>?  -  xf_x)  -  =  (b2  -  a2)(d  -  c), 

1=1  7=1 
r  s 

s2  =      - x'-^  E^2  -      =  &  -  aW2  - c2)< 

(=1  y=i 

r  s 

E4  =  ^(x!  -xi-1)E^y-^-i)2  <  \\P\\(b  -  a)(d  -  c). 
(=1  7=1 

Substituting  these  four  results  into  (7.1.15)  shows  that 

/  <  S(P)  <  I  +  ||P||(i-fl)(rf-c), 

where 

_  (d  -  c)(b2  -  a2)  +  (b-  a){d2  -  c2) 


From  this,  we  see  that 


/  (x  +  y)d(x,y)  =  /. 
Jr 


After  substituting 

x,_i  +  =  i[(xi  +  Xj_i)  +  (x/  +  X/-0  -  (•*<  -  Xi-i)  -  (yj  -  yj-i)] 
into  (7.1.14),  a  similar  argument  shows  that 

/  -  \\P\\(b-a)(d  -c)  <s(P)  <  I, 
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so 


f  (x  +  y)d(x,y)  =  I 


J R 


We  now  prove  an  analog  of  Lemma  3.2.1. 


Lemma  7.1.6  Suppose  that  |/(X)|  <  M  ifX  is  in  the  rectangle 


R  =  [ai,bi]  x  [a2,b2]  x  ■•■  x  [a„,b„]. 


Let  P  =  P\  x  P2  x  •  •  •  x  P„  and  P'  =  P[  x  P'2  x  •  ■  •  x  P'n  be  partitions  of  R,  where  Pj 
is  obtained  by  adding  rj  partition  points  to  Pj ,  1  <  j  <n.  Then 


Proof  We  will  prove  (7.1.16)  and  leave  the  proof  of  (7.1.17)  to  you  (Exercise  7.1.7). 
First  suppose  that  P[  is  obtained  by  adding  one  point  to  P\,  and  P'-  =  Pj  for  2  <  j  <  n. 
If  Pr  is  defined  by 

Pr  :  ar  =  aro  <  ar\  <  ■  ■  ■  <  armr  =  br,     1  <  r  <  n, 

then  a  typical  subrectangle  of  P  is  of  the  form 

Rhh~Jn  =  [fll,y'l-l'aly'l]  X  [a2,y2-l  •  aVl\  X        X  lanjn-l'anj„]- 

Let  c  be  the  additional  point  introduced  into  P\  to  obtain  P[,  and  suppose  that 

a\,k-\  <  c  <  alk. 

If  ji  k,  then  Rjl  j2-jn  15  common  to  P  and  P' ,  so  the  terms  associated  with  it  in  S(P') 
and  S(P)  cancel  in  the  difference  S(P)  —  S(P').  To  analyze  the  terms  that  do  not  cancel, 
define 


(7.1.16) 


and 


(7.1.17) 


R 


R 


kjl-jn 
,(2) 

kjz-jn 


[auk-i,c]  x  [a2 

,72—1'  ^2/2]  x  ■  •  ■  x  [anjn—i,  anjn], 

[c,alk]  x  [a2 

,H~ !'  ^272]  X  ■  ■  ■  X  [anjn-i,  Clnj„], 

MkjT..  jn  =  sup  {/(X)  j  X  e  A'a /,.../„! 


(7.1.18) 


and 


(7.1.19) 
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Then  S(P)  —  S(P')  is  the  sum  of  terms  of  the  form 

[Mkj2...jn(alk  -  flu_0  -  M™„Jh(c  -  au_x)  -  .^u  -  c)]       (? ^  2Q) 

X(a2/2  —  fl2,./2-l)  '  '  '  (an/„  —  a„Jn-\). 

The  terms  within  the  brackets  can  be  rewritten  as 

(Mkh...Jn  -  M^...jn)(c  -  au_0  +  (Mkj2...jn  -  Mgl.jn)(alk  -  c),  (7.1.21) 

which  is  nonnegative,  because  of  (7.1.18)  and  (7.1.19).  Therefore, 

S(P')  <  S(P).  (7.1.22) 

Moreover,  the  quantity  in  (7.1.21)  is  not  greater  than  2M{a\k  —  ai^-i),  so  (7.1.20)  implies 
that  the  general  surviving  term  in  S(P)  —  S(P')  is  not  greater  than 

2M\\P\\(a2j2  -a2j2-i)---(anjn  -a„jn-i). 

The  sum  of  these  terms  as  jz,  ■  ■  ■ ,  jn  assume  all  possible  values  1  <  /;  <  mi,  2  <  i  <n, 
is 

2M\\P\\V(R) 

2M  ||  P  ||  (b2 -a2)---(bn-an)=       "    "  . 

b\  —  a\ 

This  implies  that 

2Af||P||VXR) 
S(P)  <  S(P')  +  — ,"    "  v  -. 

b\  —  a\ 

This  and  (7.1.22)  imply  (7.1.16)  for  r\  =  1  and  r%  =  ■  ■  ■  =  rn  =  0. 
Similarly,  if  r,-  =  1  for  some  i  in  {1, . . . ,  n)  and  rj  =  0  if  j  ^  i,  then 

2M\\P\\V(R) 


S(P)  <  S(P')  + 


To  obtain  (7.1.16)  in  the  general  case,  repeat  this  argument  r\  +  r2  +  ■  ■  ■  +  rn  times,  as  in 
the  proof  of  Lemma  3 .2. 1 .  H 

Lemma  7.1.6  implies  the  following  theorems  and  lemma,  with  proofs  analogous  to  the 
proofs  of  their  counterparts  in  Section  3.2. 

Theorem  7.1.7  If  f  is  bounded  on  a  rectangle  R,  then 

[  f(X)dX<  f  f(X)dX. 

JR  JR 

Proof   Exercise  7.1.8. 

The  next  theorem  is  analogous  to  Theorem  3.2.3. 

Theorem  7.1.8  Iff  is  integrable  on  a  rectangle  R,  then 

[  f(X)dX=  I  f{X)dX=  (  f(X)dX. 

JR  JR  JR 

Proof   Exercise  7.1.9. 
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Lemma  7.1.9  If  f  is  bounded  on  a  rectangle  R  and  e  >  0,  there  is  a  8  >  0  such  that 

[  f(X)dX<S(P)  <  [  f(X)dX  +  e 
Jr  Jr 

and 

[  f(X)dX>s(P)  >  f  f(X)dX-e 
Jr  Jr 

if\\P\\  <S. 

Proof   Exercise  7.1.10. 

The  next  theorem  is  analogous  to  Theorem  3.2.5. 

Theorem  7.1.10  Iff  is  bounded  on  a  rectangle  R  and 

f  f(X)dX=  [  f(X)dX  =  L. 
Jr  Jr 

then  f  is  integrable  on  R,  and 

[  f(X)dX  =  L. 
Jr 

Proof   Exercise  7.1.11. 

Theorems  7.1.8  and  7.1.10  imply  the  following  theorem,  which  is  analogous  to  Theo- 
rem 3.2.6. 

Theorem  7.1.11  A  bounded  function  f  is  integrable  on  a  rectangle  R  if  and  only  if 

[  f(X)dX=  [  f(X)dX. 
Jr  Jr 

The  next  theorem  translates  this  into  a  test  that  can  be  conveniently  applied.  It  is  analo- 
gous to  Theorem  3.2.7. 

Theorem  7.1.12  If  f  is  bounded  on  a  rectangle  R,  then  f  is  integrable  on  R  if  and 
only  if  for  every  e  >  0  there  is  a  partition  P  of  R  such  that 

S(P)-s{P)  <  €. 

Proof   Exercise  7.1.12. 

Theorem  7.1.12  provides  a  useful  criterion  for  integrability.  The  next  theorem  is  an 
important  application.  It  is  analogous  to  Theorem  3.2.8. 

Theorem  7.1.13  Iff  is  continuous  on  a  rectangle  R  inW ,  then  f  is  integrable  on  R. 
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Proof  Let  e  >  0.  Since  /  is  uniformly  continuous  on  R  (Theorem  5.2.14),  there  is  a 
8  >  0  such  that 

|/(X)-/(X')|<-1-  (7.1.23) 

V(K) 

ifXandX'  are  in  R  and  |X-X'|  <  8.  Let  P  =  {R^  R2, Rk}  be  a  partition  of  R  with 
||  f  ||  <  8/ \fn.  Since  /  is  continuous  on  R,  there  are  points  X/  and  X'-  in  Rj  such  that 

f(Xj)  =  Mj  =  sup  f(X)    and     /(X'y)  =       =   inf  /(X) 
XeRj 

(Theorem  5.2.12).  Therefore, 

n 

S(P) -s(P)  =  ^(/(X/)  -  f(X'j))V(Rj). 

.7=1 

Since  ||  P  ||  <  8/ y/n,  \Xj  -  X'j  |  <  5,  and,  from  (7.1 .23)  with  X  =  X/  and  X'  =  X^. , 

k 

S(p)-*(p)<^X>tfy)  =  €. 
Hence,  /  is  integrable  on  R,  by  Theorem  7.1.12. 


Sets  with  Zero  Content 

The  next  definition  will  enable  us  to  establish  the  existence  of  fR  f  (X)  dX  in  cases  where 
/  is  bounded  on  the  rectangle  R,  but  is  not  necessarily  continuous  for  all  X  in  R. 

Definition  7.1.14  A  subset  E  of  E"  has  zero  content  if  for  each  e  >  0  there  is  a  finite 
set  of  rectangles  T\,  T2,  ■  ■  ■ ,  Tm  such  that 

m 

E  C  (J  Tj  (7.1.24) 

./  =  ! 

and 

(7.1.25) 

./  =  ! 

Example  7.1.3  Since  the  empty  set  is  contained  in  every  rectangle,  the  empty  set  has 
zero  content.  If  E  consists  of  finitely  many  points  Xi,  X2, . . . ,  Xm,  thenX/  can  be  enclosed 
in  a  rectangle  Tj  such  that 

V(Tj)  <—,     1  <  j  <  m. 
m 

Then  (7.1.24)  and  (7.1.25)  hold,  so  E  has  zero  content. 
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Example  7.1.4  Any  bounded  set  E  with  only  finitely  many  limit  points  has  zero  con- 
tent. To  see  this,  we  first  observe  that  if  E  has  no  limit  points,  then  it  must  be  finite,  by  the 
Bolzano-Weierstrass  theorem  (Theorem  1.3.8),  and  therefore  must  have  zero  content,  by 
Example  7.1.3.  Now  suppose  that  the  limit  points  of  E  are  Xi,  X2,  . . . ,  Xm.  Let  R\,  R2, 
. . . ,  Rm  be  rectangles  such  that  X,  e  Rf  and 

V(Ri)  <  — .     1  <  /  <m.  (7.1.26) 

2m 

The  set  of  points  of  E  that  are  not  in  UrJ_1Rj  has  no  limit  points  (why?)  and,  being 
bounded,  must  be  finite  (again  by  the  Bolzano-Weierstrass  theorem).  If  this  set  contains  p 
points,  then  it  can  be  covered  by  rectangles  R[,  R'2,       R'p  with 


Now, 


(7.1.27) 


and,  from  (7.1.26)  and  (7.1.27), 

m  p 

Y,v(Ri)  +  YJnR'j)<t- 
;=i  j=\ 

Example  7.1.5  If  /  is  continuous  on  [a,  b],  then  the  curve 

y  =  f(x),    a  <x  <b  (7.1.28) 

(that  is,  the  set  {(x,  y)  |  y  =  f(x),  a  <  x  <  /?}),  has  zero  content  in  M2.  To  see  this, 
suppose  that  e  >  0,  and  choose  S  >  0  such  that 

\f(x)- f(x')\  <  e    if   x,x'€[a,b]    and    \x-x'\<8.  (7.1.29) 

This  is  possible  because  /  is  uniformly  continuous  on  [a,  b]  (Theorem  2.2.12).  Let 

P  :  a  =  xo  <  x\  <  ■■•  <  xn  =b 

be  a  partition  of  [a,  b]  with  \\P\\  <  S,  and  choose  £1,      •••>£»  so  that 

Xi-i  <  &  <  Xi,     1  <  i  <  n. 

Then,  from  (7.1.29), 

\f(x)-Mi)\<€     if  Xi-X<X<Xi. 

This  means  that  every  point  on  the  curve  (7.1.28)  above  the  interval  [Xi—i,  Xi\  is  in  a  rect- 
angle with  area  2e(x,-  —  (Figure  7.1.4).  Since  the  total  area  of  these  rectangles  is 
2e(b  —  a),  the  curve  has  zero  content. 


450    Chapter  7  Integrals  of  Functions  of  Several  Variables 


Figure  7.1.4 

The  next  lemma  follows  immediately  from  Definition  7.1.14. 

Lemma  7.1.15  The  union  of  finitely  many  sets  with  zero  content  has  zero  content. 

The  following  theorem  will  enable  us  to  define  multiple  integrals  over  more  general 
subsets  ofR". 

Theorem  7.1.16  Suppose  that  f  is  bounded  on  a  rectangle 

R  =  [ai,bi]  x  [a2,b2]  x  ■•■  x  [a„,b„]  (7.1.30) 
and  continuous  except  on  a  subset  E  of  R  with  zero  content.  Then  f  is  integrable  on  R. 

Proof  Suppose  that  e  >  0.  Since  E  has  zero  content,  there  are  rectangles  T\,  T%,  . . . , 
Tm  such  that 


E  C  |J  Tj 


(7.1.31) 


./=i 


and 


J2v(Tj)<€.  (7.1.32) 
j=i 

We  may  assume  that  T\,  T%,  . . .,  Tm  are  contained  in  R,  since,  if  not,  their  intersections 
with  R  would  be  contained  in  R,  and  still  satisfy  (7.1.31)  and  (7.1.32).  We  may  also  assume 
that  if  T  is  any  rectangle  such  that 


T  Pi  I  U  Ti  I  =  0-    then     TnE  = 


(7.1.33) 
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since  if  this  were  not  so,  we  could  make  it  so  by  enlarging  T\,  T%,  Tm  slightly  while 
maintaining  (7.1.32).  Now  suppose  that 

Tj  =  [ay, by]  x  [a2j,b2j]  x  •■  •  x  [a„j,bnj],     1  <  j  <  m, 

let  Pjo  be  the  partition  of  [a; ,  bi]  (see  (7.1 .30))  with  partition  points 

&i,bi,ai\,bi\,aii,  bi%,...,  a;m,  bim 

(these  are  not  in  increasing  order),  1  <  i  <n,  and  let 

A)  =  Pw  x  P20  x  •■■  x  Pn0. 

Then  Pq  consists  of  rectangles  whose  union  equals  U™=1Tj  and  other  rectangles  T[,  T^, 
T'k  that  do  not  intersect  E.  (We  need  (7.1.33)  to  be  sure  that  T-  n  E  =  0, 1  <  i  <  k.) 
If  we  let 

m  k 

B=\jTj    and    C  =  (J  7}', 
j=i  i=i 

then  R  =  B  U  C  and  /  is  continuous  on  the  compact  set  C .  If  P  =  {R\,  R2,  ■  ■  ■ ,  Rk}  is 
a  refinement  of  i^,  then  every  subrectangle  Rj  of  P  is  contained  entirely  in  B  or  entirely 
in  C .  Therefore,  we  can  write 

S(P)-s(P)  =  UtiMj  -mj)V(Rj)  +  H2(Mj  -mj)V(Rj),  (7.1.34) 

where  Si  and  S2  are  summations  over  values  of  j  for  which  Rj  C  B  and  Rj  C  C, 
respectively.  Now  suppose  that 

\f(X)\<M  forXintf. 

Then 

m 

Si(M7-  -  mj)V(Rj)  <  2M  ZiV(Rj)  =  2M  ^  7(7//)  <  2Me,  (7.1.35) 

7  =  1 

from  (7.1.32).  Since  /  is  uniformly  continuous  on  the  compact  set  C  (Theorem  5.2.14), 
there  is  a  S  >  0  such  that  Mj  —  itij  <  e  if  \\P\\  <  S  and  Rj  C  C;  hence, 

E2(M/  -  mj)V{Rj)  <  e£2  7(*y)  <  eF(/?). 

This,  (7.1.34),  and  (7.1.35)  imply  that 

S{P)  -s(P)  <  [2M  +  V(R)]e 

if  ||  P  ||  <  S  and  P  is  a  refinement  of  Po-  Therefore,  Theorem  7.1.12  implies  that  /  is 
integrable  on  R.  TJ 
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Example  7.1.6  The  function 


f(x,y)  = 


[x  +  y,    0  <  x  <  y  <  1, 


1 5.  0  <  y  <  x  <  1, 

is  continuous  onfi=  [0,  1]  x  [0, 1]  except  on  the  line  segment 

y  =  X,      0  <  X  <  1 

(Figure  7.1.5).  Since  the  line  segment  has  zero  content  (Example  7.1.5),  /  is  integrable  on 
R. 


Figure  7.1.5 


Integrals  over  More  General  Subsets  of  R" 

We  can  now  define  the  integral  of  a  bounded  function  over  more  general  subsets  of  R" . 

Definition  7.1.17  Suppose  that  /  is  bounded  on  a  bounded  subset  of  S  of  R",  and  let 

j/(X),  XeS, 


fs(X) 


(0,  X^S. 

Let  R  be  a  rectangle  containing  S.  Then  the  integral  of  f  over  S  is  defined  to  be 

/  f(X)dX=  [  fs(X)dX 
Js  Jr 

if  fR  fs(X)  dX  exists. 


(7.1.36) 
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To  see  that  this  definition  makes  sense,  we  must  show  that  if  Ri  and  R2  are  two  rect- 
angles containing  5  and  fR  fs(X)dX  exists,  then  so  does  JR  fs(X)dX,  and  the  two 
integrals  are  equal.  The  proof  of  this  is  sketched  in  Exercise  7.1.27. 

Definition  7.1.18  If  S  is  a  bounded  subset  of  R"  and  the  integral  L  dX  (with  inte- 
grand /  =  1)  exists,  we  call  fs  dX  the  content  (also,  area  if  n  =  2  or  volume  if  n  =  3) 
of  5,  and  denote  it  by  V(S);  thus, 

V(S)  =  j  dX. 

Theorem  7.1.19  Suppose  that  f  is  bounded  on  a  bounded  set  S  and  continuous  ex- 
cept on  a  subset  E  of  S  with  zero  content.  Suppose  also  that  dS  has  zero  content.  Then  f 
is  integrable  on  S. 

Proof  Let  fs  be  as  in  (7.1.36).  Since  a  discontinuity  of  fs  is  either  a  discontinuity  of  / 
or  a  point  of  3S,  the  set  of  discontinuities  of  fs  is  the  union  of  two  sets  of  zero  content  and 
therefore  is  of  zero  content  (Lemma  7.1.15).  Therefore,  fs  is  integrable  on  any  rectangle 
containing  5  (from  Theorem  7.1.16),  and  consequently  on  S  (Definition  7.1.17).  H 

Differentiable  Surfaces 

Differentiable  surfaces,  defined  as  follows,  form  an  important  class  of  sets  of  zero  content 


Definition  7.1.20  A  differentiable  surface  S  in  R"  («  >  1)  is  the  image  of  a  compact 
subset  D  of  Rm,  where  m  <  n,  under  a  continuously  differentiable  transformation  G  : 
Rm      R".  If  m  =  1,  S  is  also  called  a  differentiable  curve. 


Example  7.1.7  The  circle 


{{x,y)\x2  +  y2  =  9} 


is  a  differentiable  curve  in  R2,  since  it  is  the  image  of  D  =  [0,  2n]  under  the  continuously 
differentiable  transformation  G  :  R  — >•  R2  defined  by 


X  =  G(0)  = 


3  cos  9 
3  sin# 


Example  7.1.8  The  sphere 

{(x,y,z)  j  x2  +  y2  +  z2  =  4} 
is  a  differentiable  surface  in  R3,  since  it  is  the  image  of 

D  =  {(6, 4>)  j  0  <  6  <  2tt,  -n/2  <  <p  <  ir/2} 
under  the  continuously  differentiable  transformation  G  :  R2  — ►  R3  defined  by 


X  =  G(0,  <p)  = 


2  cos  6  cos ( 
2  sin  (9  cos  ^ 
2  sin0 
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Example  7.1.9  The  set 

{(xi,  X2,  X3,  X4)  \  Xi  >  0  (z  =  1, 2,  3, 4),  Xi  +  x%  =  1,  x$  +  X4  =  l) 

is  a  differentiable  surface  in  R4,  since  it  is  the  image  of  D  =  [0,  1]  x  [0,  1]  under  the 
continuously  differentiable  transformation  G  :  ]R2  — >  M4  defined  by 

X  =  G(u,v)  = 


11 

1  —  u 

V 

l-v 


Theorem  7.1.21  A  differentiable  surface  in  K"  has  zero  content. 

Proof  Let  S,  D,  and  G  be  as  in  Definition  7.1.20.  From  Lemma  6.2.7,  there  is  a 
constant  M  such  that 

|G(X)-G(Y)|  <  M|X-Y|    if   X,YeO.  (7.1.37) 
Since  D  is  bounded,  D  is  contained  in  a  cube 

C  =  [ai,bi]  x  [a2,b2]  x  -  x  [am,bm], 

where 

bj  —  ai  =  L,     1  <  i  <  m. 

Suppose  that  we  partition  C  into  Nm  smaller  cubes  by  partitioning  each  of  the  intervals 
[a, ,  bi]  into  N  equal  subintervals.  Let  Ri ,  R2, R^be  the  smaller  cubes  so  produced  that 
contain  points  of  D,  and  select  points  Xi,  X2,  . . . ,  X^  such  that  X,  e  D  fl  i?,,  1  <  i  <k. 
If  Y  6  D  n     ,  then  (7.1.37)  implies  that 


|G(X/)-G(Y)|  <M|Xi-Y|. 
Since  X,-  and  Y  are  both  in  the  cube  R,  with  edge  length  L/N, 


(7.1.38) 


Y  < 


N 


This  and  (7.1.38)  imply  that 


|G(X,-)-G(Y)|  < 


MLjm 


N 


which  in  turn  implies  that  G(Y)  lies  in  a  cube  Ri  in  M"  centered  at  G(X,- ),  with  sides  of 
length  2ML^/m/ N .  Now 


^     ~       ,  (2MLJm\ 
J2V(Ri)=k(^-)  N" 

i  =  l  ^ 


(IMLJm 


N 


j    =  (2MLjm)"Nm-n. 


Since  n  >  m,  we  can  make  the  sum  on  the  left  arbitrarily  small  by  taking  ./V  sufficiently 
large.  Therefore,  S  has  zero  content.  H 

Theorems  7.1.19  and  7.1.21  imply  the  following  theorem. 
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Theorem  7.1.22  Suppose  that  S  is  a  bounded  set  in  W ,  with  boundary  consisting  of 
a  finite  number  of  differentiable  surfaces.  Let  f  be  bounded  on  S  and  continuous  except 
on  a  set  of  zero  content.  Then  f  is  integrable  on  S. 

Example  7.1.10  Let 

S  =  {(x,y)\x2  +  y2  =  1,  x>0}; 

thus,  S  is  bounded  by  a  semicircle  and  a  line  segment  (Figure  7.1.6),  both  differentiable 
curves  in  M2.  Let 

(    (1  -x2  -y2)1/2,    (x,  y)  €  S,  y  >  0, 
f(x,  y)  =  I 

(-(1  -x2  -y2)1/2,    (x,y)  <e  S,  y  <0. 

Then  /  is  continous  on  S  except  on  the  line  segment 

y  =  0,    0  <  x  <  1, 

which  has  zero  content,  from  Example  7.1.5.  Hence,  Theorem  7.1.22  implies  that  /  is 
integrable  on  S . 


Figure  7.1.6 


Properties  of  Multiple  Integrals 

We  now  list  some  theorems  on  properties  of  multiple  integrals.  The  proofs  are  similar  to 
those  of  the  analogous  theorems  in  Section  3.3. 

Note:  Because  of  Definition  7.1.17,  if  we  say  that  a  function  /  is  integrable  on  a  set  S, 
then  S  is  necessarily  bounded. 


456    Chapter  7  Integrals  of  Functions  of  Several  Variables 

Theorem  7.1.23  //  /  and  g  are  integrable  on  S,  then  so  is  f  +  g,  and 
f  (f  +  g)(X)dX=  f  f(X)dX+  f  g(X)dX. 

J  S  i/5  J  S 

Proof   Exercise  7.1.20. 

Theorem  7.1.24  If  f  is  integrable  on  S  and  c  is  a  constant,  then  c f  is  integrable  on 
S,  and 

f  (cf)(X)dX  =  c  f  f(X)dX. 
Js  Js 

Proof   Exercise  7.1.21. 

Theorem  7.1.25  Iff  and  g  are  integrable  on  S  and  f  (X)  <  g(X)  for  X  in  S,  then 

(  f(X)dX<  [  g(X)dX. 
Js  Js 

Proof   Exercise  7.1.22. 

Theorem  7.1.26  If  f  is  integrable  on  S,  then  so  is  \f\,  and 

\f  f(X)dx\  <  f  \f(X)\dX. 
\Js  I  Js 

Proof   Exercise  7.1.23. 

Theorem  7.1.27  If  f  and  g  are  integrable  on  S,  then  so  is  the  product  fg. 
Proof   Exercise  7.1.24. 

Theorem  7.1.28  Suppose  that  u  is  continuous  and  v  is  integrable  and  nonnegative  on 
a  rectangle  R.  Then 

I  u(X)v(X)dX  =  w(X0)  /  v(X)dX 
Jr  Jr 

for  some  Xq  in  R. 
Proof   Exercise  7.1.25. 

Lemma  7.1.29  Suppose  that  S  is  contained  in  a  bounded  set  T  and  f  is  integrable 
on  S.  Then  fs  (see  (7.1.36))  is  integrable  on  T,  and 

jTfs{X)dX  =  j  f(X)dX. 
Proof   From  Definition  7.1.17  with  /  and  S  replaced  by  fs  and  T, 


Section  7.1  Definition  and  Existence  of  the  Multiple  Integral  457 


(fs)T(X) 


fs(X),  XeT, 


0. 


XgT. 


Since  S  C  T ,  (fs)r  =  fs-  (Verify.)  Now  suppose  that  R  is  a  rectangle  containing  T . 
Then  R  also  contains  S  (Figure  7.1.7), 


Figure  7.1.7 


/  /(X)  dX  =  I  fs(X)  dX  (Definition  7.1.17,  applied  to  /  and  S) 
Js  Jr 

=  [  (fs)r(X)dX  (since  (JS)T  =  fs) 
Jr 

=  j  fs(X)  dX  (Definition  7.1.17,  applied  to  fs  and  T), 
which  completes  the  proof. 


H 


Theorem  7.1.30  Iff  is  integrable  on  disjoint  sets  S\  and  S2,  then  f  is  integrable  on 
S\  U  S2,  and 

f  f(X)dX=  [  f(X)dX+  [  f(X)dX.  (7.1.39) 
JSlUS2  J  Si  Js2 


Proof   Fori  =  1,2,  let 


A(X)  = 


;/(X),  XeS,- 


[0,  XgSi. 

From  Lemma  7.1.29  with  S  =  Si  and  T  =  Si  U  S2,  /s,  is  integrable  on  Si  U  S2,  and 
/        fSj(X)dX=  [  f(X)dX,     i  =  1,2. 

JS1US2  JSi 

Theorem  7.1.23  now  implies  that  fg1  +  fs2  is  integrable  on  Si  U  S2  and 

f  C/Si  +  fs2)(X)dX=  f  f(X)dX+  I  f{X)dX.  (7.1.40) 
JSiUSi  Js,  Js2 
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Since  Si  n  S2  =  0, 

(fSl  +  fS2)  (X)  =  fSl  (X)  +  fS2(X)  =  f(X).    X  e  Si  U  S2. 

Therefore,  (7. 1 .40)  implies  (7.1.39).  H 
We  leave  it  to  you  to  prove  the  following  extension  of  Theorem  7.1 .30.  (Exercise  7.1.31  (b)). 

Corollary  7.1.31  Suppose  that  f  is  integrable  on  sets  Si  and  S2  such  that  S\  H  S2 
has  zero  content.  Then  f  is  integrable  on  Si  U  S2,  and 

[        f(X)dX=  [   f(X)dX+  f  f(X)dX. 
JS1US2  Jst  Js2 

Example  7.1.11  Let 

51  =  {(x,  y)  I  0  <  x  <  1,  0  <  y  <  1  +  x) 

and 

52  =  {(x,  y)  I  -  1  <  x  <  0,  0  <  y  <  1  -  x) 

(Figure  7.1.8). 

y 


s 


Figure  7.1.8 


Then 


Sins2  =  {(o,y)|o<y<i} 

has  zero  content.  Hence,  Corollary  7. 1 .3 1  implies  that  if  /  is  integrable  on  Si  and  S2,  then 
/  is  also  integrable  over 

S  =  Si  U  S2  =  {(x,  y)  \  -  1  <  x  <  1,  0<j<1  + 

(Figure  7.1.9),  and 

/       f(X)dX=  [  f(X)dX+  f  f{X)dX. 
Jslus2  J  Si  Js2 
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v=  1  +  x 


->-  X 


y  =  1  —  x 


Figure  7.1.9 

We  will  discuss  this  example  further  in  the  next  section. 

7.1  Exercises 


1.  Prove:  If  R  is  degenerate,  then  Definition  7.1.2  implies  that  fR  f(X)  dX  =  0  if  / 
is  bounded  on  R. 

2.  Evaluate  directly  from  Definition  7. 1 .2. 

(a)  fR(3x  +  2y)d(x,  y);    R  =  [0, 2]  x  [1, 3] 

(b)  fRxyd(x,y);    R  =  [0,  1]  x  [0,  1] 

3.  Suppose  that  f  f  (x)  dx  and      g(y)  dy  exist,  and  let  R  =  [a,  b]  x  [c,  d\.  Criticize 
the  following  "proof"  that  fR  f  (x)g(y)  d(x,  y)  exists  and  equals 


(See  Exercise  7.1.30  for  a  correct  proof  of  this  assertion.) 
"Proof."  Let 

Pi  :  a  =  xo  <  x\  <  ■  ■  ■  <  xr  =  b    and    P2  '■  c  =  yo  <  yi  <  ■  ■  ■  <  ys  =  d 

be  partitions  of  [a,  b]  and  [c,  d],  and  P  =  P\  x  P2.  Then  a  typical  Riemann  sum  of 
fg  over  P  is  of  the  form 

r  s 

0-  =  X!  X!  f(^i)g(j)j)(xi  -  Xi-i)(yj  -  yj-i)  =  oxa%, 
(=1 j=i 

where 

Y  S 

01  =  X!         _  x<-i)  and  CT2  =  X!  s(nj)iyj  -  yj-i) 
i=i  j=i 
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7. 

8. 

9. 
10. 
11. 
12. 
13. 
14. 


15. 


are  typical  Riemann  sums  of  /  over  [a,  b]  and  g  over  [c,<i].  Since  /  and  g  are 
integrable  on  these  intervals, 


"/*'< 

J  a 


x)  dx 


and 


02 


-r 


can  be  made  arbitrarily  small  by  taking  and  \P%\  sufficiently  small.  From 

this,  it  is  straightforward  to  show  that 


-(/: 


/ (x)  dx 


g(y)  dy 


can  be  made  arbitrarily  small  by  taking  ||  P  ||  sufficiently  small.  This  implies  the 
stated  result. 

4.  Suppose  that  f(x,y)  >  0  on  R  =  [a,b]  x  [c,  d].  Justify  the  interpretation  of 
Jr  f(x>  y)  d(x,  y),  if  it  exists,  as  the  volume  of  the  region  in  R3  bounded  by  the 
surfaces  z  =  f(x,  y)  and  the  planes  z  =  0,  x  =  a,  x  =  b,  y  =  c,  and  y  =  d. 

5.  Prove  Theorem  7.1.5.  Hint:  See  the  proof  of  Theorem  3.1.4. 

6.  Suppose  that 


f(x,y)  = 


0  if  x  and  y  are  rational, 

1  if  x  is  rational  and  y  is  irrational, 

2  if  x  is  irrational  and  y  is  rational, 

3  if  x  and  y  are  irrational. 


Find 


(  /(*. 
Jr 


y)d(x,y)  and 


f  fix, 
Jr 


y)d(x,y)    if    R  =  [a,  b]  x  [c,  d\. 


Prove  Eqn.  (7.1.17)  of  Lemma  7.1.6. 

Prove  Theorem  7.1.7  Hint:  See  the  proof  of  Theorem  3.2.2. 

Prove  Theorem  7.1.8  Hint:  See  the  proof  of  Theorem  3.2.3. 

Prove  Lemma  7.1.9  Hint:  See  the  proof  of  Lemma  3.2.4. 

Prove  Theorem  7.1.10  Hint:  See  the  proof  of  Theorem  3.2.5. 

Prove  Theorem  7.1.12  Hint:  See  the  proof  of  Theorem  3.2.7. 

Give  an  example  of  a  denumerable  set  in  M2  that  does  not  have  zero  content. 

Prove: 

(a)  If  Si  and  S2  have  zero  content,  then  Si  U  S2  has  zero  content. 

(b)  If  Si  has  zero  content  and  S2  C  Si,  then  S2  has  zero  content. 

(c)  If  S  has  zero  content,  then  S  has  zero  content. 
Show  that  a  degenerate  rectangle  has  zero  content. 
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16.  Suppose  that  /  is  continuous  on  a  compact  set  S  in  R".  Show  that  the  surface 
Z  =  f  (X),  X  e  5,  has  zero  content  in  R"+1 .  Hint:  See  Example  7.1.5. 

17.  Let  5  be  a  bounded  set  such  that  S  fl  35  does  not  have  zero  content. 

(a)  Suppose  that  /  is  defined  on  S  and  / (X)  >  p  >  0  on  a  subset  T  of  5  fl  dS 
that  does  not  have  zero  content.  Show  that  /  is  not  integrable  on  S. 

(b)  Conclude  that  V(S)  is  undefined. 

18.  (a)    Suppose  that  h  is  bounded  and  h(X)  =  0  except  on  a  set  of  zero  content. 

Show  that  fs  h(X)  dX  =  0  for  any  bounded  set  S. 
(b)    Suppose  that  fs  f  (X)  dX  exists,  g  is  bounded  on  5,  and  / (X)  =  g(X)  except 
for  X  in  a  set  of  zero  content.  Show  that  g  is  integrable  on  S  and 

/  g(X)dX=  f  f(X)dX. 
Js  Js 

19.  Suppose  that  /  is  integrable  on  a  set  S  and  So  is  a  subset  of  S  such  that  3So  has 
zero  content.  Show  that  /  is  integrable  on  So- 

20.  Prove  Theorem  7.1.23  Hint:  See  the  proof  of  Theorem  3.3.1. 

2 1 .  Prove  Theorem7.1.24. 

22.  Prove  Theorem  7.1.25  Hint:  See  the  proof  of  Theorem  3.3.4. 

23.  Prove  Theorem  7.1.26  Hint:  See  the  proof  of  Theorem  3.3.5. 

24.  Prove  Theorem  7.1.27  Hint:  See  the  proof  of  Theorem  3.3.6. 

25.  Prove  Theorem  7.1.28  Hint:  See  the  proof  of  Theorem  3.3.7. 

26.  Prove:  If  /  is  integrable  on  a  rectangle  R,  then  /  is  integrable  on  any  subrectangle 
of  R.  Hint:  Use  Theorem  7.1.12;  see  the  proof  of  Theorem  3.3.8. 

27.  Suppose  that  R  and  R  are  rectangles,  R  C  R,  g  is  bounded  on  R,  and  g(X)  =  0  if 
X  $  R. 

(a)  Show  that  g(X)  dX  exists  if  and  only  if  fR  g(X)dX  exists  and,  in  this 
case, 

[g(X)dX=  [  g(X)dX. 
Jr  J R 

HINT:  Use  Exercise  7.1.26. 

(b)  Use  (a)  to  show  that  Definition  7. 1 . 17  is  legitimate;  that  is,  the  existence  and 
value  of  fs  f(X)  dX  does  not  depend  on  the  particular  rectangle  chosen  to 
contain  S . 

28.  (a)    Suppose  that  /  is  integrable  on  a  rectangle  R  and  P  =  {R\,  R2, . . . ,  Rk}  is 

a  partition  of  R.  Show  that 


HINT:  Use  Exercise  7.1.26. 
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(b)    Use  (a)  to  show  that  if  /  is  continuous  on  R  and  P  is  a  partition  of  R,  then 
there  is  a  Riemann  sum  of  /  over  P  that  equals  fR  f  (X)  dX. 

29.  Suppose  that  /  is  continuously  differentiable  on  a  rectangle  R.  Show  that  there  is  a 
constant  M  such  that 

or  -  I  f(X)dX  <  M  ||P  || 
Jr 

if  <t  is  any  Riemann  sum  of  /  over  a  partition  P  of  R.  Hint:  Use  Exercise  7.1 .28(b) 
and  Theorem  5.4.5. 

30.  Suppose  that  f  f  (x)  dx  and      g(y)  dy  exist,  and  let  R  =  [a,  b]  x  [c,  rf]. 

(a)  Use  Theorems  3.2.7  and  7.1.12  to  show  that 

/  f(x)d(x,y)    and      /  g(y)d(x,y) 
Jr  Jr 

both  exist. 

(b)  Use  Theorem  7.1.27  to  prove  that  fR  f  (x)g(y)  d(x,  y)  exists. 

(c)  Justify  using  the  argument  given  in  Exercise  7.1.3  to  show  that 

jRf(x)g(y)d{x,y)  =        f(x)dxj  (j*  g(y)dy)j  . 

31.  (a)    Suppose  that  /  is  integrable  on  S  and  So  is  obtained  by  removing  a  set  of 

zero  content  from  S.  Show  that  /  is  integrable  on  So  and  fs  f(X)dX  = 
JsfWdX. 
(b)   Prove  Corollary  7.1.31. 

7.2  ITERATED  INTEGRALS  AND  MULTIPLE  INTEGRALS 


Except  for  very  simple  examples,  it  is  impractical  to  evaluate  multiple  integrals  directly 
from  Definitions  7.1.2  and  7.1.17.  Fortunately,  this  can  usually  be  accomplished  by  evalu- 
ating 7i  successive  ordinary  integrals.  To  motivate  the  method,  let  us  first  assume  that  /  is 
continuous  on  R  =  [a,  b]  x  [c,  d].  Then,  for  each  y  in  [c,  d],  f(x,  y)  is  continuous  with 
respect  to  x  on  [a,  b],  so  the  integral 

F(y)=  f  f(x,y)dx 

J  a 

exists.  Moreover,  the  uniform  continuity  of  /  on  R  implies  that  F  is  continuous  (Exer- 
cise 7.2.3)  and  therefore  integrable  on  [c,  d].  We  say  that 

h  =  So    F{y)  ^  =  So    (fa    f(X'  y)  ^  ^ 
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is  an  iterated  integral  of  /  over  R.  We  will  usually  write  it  as 

h  =  /    dy  f(x,y)dx. 

J  c  J  a 

Another  iterated  integral  can  be  defined  by  writing 


G(x)  =  I    f(x,y)dy,  a<x< 


and  denning 


b  I  rd 


h=  f  G(x)  dx  =  f  If 

Ja  Ja  \Jc 


f(x,  y)  dy  I  dx. 


which  we  usually  write  as 


Example  7.2.1  Let 


and#  =  [0,  1]  x  [1,2].  Then 


h  =  /    dx  f(x,y)dy. 

J  a  Jc 


f(x,y)  =  x  +  y 


F(  v)  =       f{x,y)dx  =      (x  +  y)dx  =        +  xy j 


x=0 


=  2+y 


and 


Also, 


=  2. 


'  •   ■  -  ^  (x  +  y  )dy=  (xy  + 


y=\ 


=  (2a  +  2)  -  j  x  +  l-  j  =  x  +  3-, 


and 


I2  =  fo1G(x)dx  =  fo1[x  +  l)  dx=(^-  +  3-fj 


=  2. 


In  this  example,  I\  =  1 2;  moreover,  on  setting  a  =  0,  b  =  I,  c  =  1 ,  and  d  =  2  in 
Example  7.1.1,  we  see  that 

/  (x  +  y)  d(x,  y)  =  2, 
Jr 

so  the  common  value  of  the  iterated  integrals  equals  the  multiple  integral.  The  following 
theorem  shows  that  this  is  not  an  accident. 
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Theorem  7.2.1  Suppose  that  f  is  integrable  on  R  =  [a,  b]  x  [c,  d]  and 

Hy)  =  f  f(x,y)dx 

J  a 

exists  for  each  y  in  [c,  d].  Then  F  is  integrable  on  [c,  d],  and 

J   F(y)dy  =  J  f(x,y)d(x,y);  (7.2.1) 

that  is, 

[  dy  [  f(x,y)dx=  f  f(x,y)d(x,y).  (7.2.2) 

Jc  Ja  JR 

Proof  Let 

Pi  :  a  =  xq  <  x\  <  ■  ■  ■  <  xr  =  b    and     P2  :  c  =  yo  <  y\  <  ■■■  <  ys  =  d 
be  partitions  of  [a,  b]  and  [c,  d],  and  P  =  P\  x  P2.  Suppose  that 

v/  i  <  'It  ■->■/■     !</<*,  (7-2.3) 

so 

s 

7  =  1 

is  a  typical  Riemann  sum  of  F  over  P2-  Since 

/>b  r  i>x 

F(jlj)  =  /    f(x,  m)dx  =  E  /      f(x'  ^.i)dx' 
J  a  ,=i"/-i 

(7.2.3)  implies  that  if 

ma  =  inf  {f(x,  y)  |        <  x  <  xt,  yy-i  <  y  <  yj} 

and 

Mij  =  sup  {f(x,  y)  |  xi-i  <  x  <  Xi,  yj-i  <  y  <  yy}  , 

then 

^  m,y(jC;  -  <  F(r]j)  <  ^  My  (x;  - 

;=1  !=1 

Multiplying  this  by  yy  —  J>y-i  and  summing  from  j  =  1  to  j  =  s  yields 

s      r  s 

J2J2mu(xi  -  *i-i)(yj  -  yj-0  <  J2  F^jKyj  -  yj-i) 
y=K=i  j=i 

s  r 
7=1 i=l 
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which,  from  (7.2.4),  can  be  rewritten  as 

sf(P)  <er<S/(P), 


(7.2.5) 


where  s /(P)  and  S/(P)  are  the  lower  and  upper  sums  of  /  over  P.  Now  let  sf(P2)  and 
Sf  (P2)  be  the  lower  and  upper  sums  of  F  over  P2;  since  they  are  respectively  the  infimum 
and  supremum  of  the  Riemann  sums  of  F  over  P2  (Theorem  3.1.4),  (7.2.5)  implies  that 


sf(P)  <  sF(P2)  <  SF(P2)  <  Sf(P). 


(7.2.6) 


Since  /  is  integrable  on  R,  there  is  for  each  e  >  0  a  partition  P  of  R  such  that  S/(P)  — 
s/(P)  <  e,  from  Theorem  7.1.12.  Consequently,  from  (7.2.6),  there  is  a  partition  P2  of 
[c,  d]  such  that  Sf(P2)  —  sf(P2)  <  e,so  F  is  integrable  on  [c,  d\,  from  Theorem  3.2.7. 

It  remains  to  verify  (7.2.1).  From  (7.2.4)  and  the  definition  of  F{y)  dy,  there  is  for 
each  e  >  0  a  S  >  0  such  that 


F(y)  dy  -  a 


<€    if  ||P2||<<$; 


that  is, 


€  < 


j    F{y)dy  <  a  +  e    if     ||P2||  <  $■ 
This  and  (7.2.5)  imply  that 

sf(P)-e<j^   F(y)dy  <  Sf(P)  +  e    if    ||P||  <  S, 


and  this  implies  that 


/  f(x,y)d(x,y) 
Jr 


<  ^  F(y)  dy  <  j  f{x,  y)  d(x,  y)  +  e 


(7.2.7) 


(Definition  7.1.4).  Since 


/  /(*,  y)  d(x ,  y)  =  /  f(x,y)d(x,y) 
Jr  Jr 


H 


(Theorem  7.1.8)  and  e  can  be  made  arbitrarily  small,  (7.2.7)  implies  (7.2.1). 

If  /  is  continuous  on  R,  then  /  satisfies  the  hypotheses  ofTheorem7. 2.1  (Exercise  7. 2. 3) 
so  (7.2.2)  is  valid  in  this  case. 

^ IRf{x,y)d{x,y)  and 


f{x,y)dy, 


a  <  x  < 
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exist,  then  by  interchanging  x  and  y  in  Theorem  7.2.1,  we  see  that 

pb  rd  p 

I   dx  I    f(x,y)dy=  /  fix,  y)  d(x,  y), 

Ja  Jc  JR 

This  and  (7.2.2)  yield  the  following  corollary  of  Theorem  7.2.1. 
Corollary  7.2.2  If  f  is  integrable  on  [a,  b]  x  [c,  d],  then 

t>b  i>d  t>d  r>b 

dx       f(x,y)dy=       dy  f(x,y)dx, 

Ja  Jc  Jc  J  a 

provided  that  f  (x,  y)  dy  exists  for  a  <  x  <  b  and  f  f  (x,  y)  dx  exists  fore  <  y  <  d. 
In  particular,  these  hypotheses  hold  if  f  is  continuous  on  [a,b]  x  [c,  d\. 

Example  7.2.2  The  function 

fix,  y)  =  x  +  y 

is  continuous  everywhere,  so  (7.2.2)  holds  for  every  rectangle  R.  For  example,  let  R  = 
[0,  1]  x  [1,2].  Then  (7.2.2)  yields 


(x  +  y)d(x,y) 


Ii dy  L  ^ + y)dx = Ii  i~2 +  xy) 


dy 


Since  /  also  satisfies  the  hypotheses  of  Theorem  7.2.1  with  x  and  y  interchanged,  we 
can  calculate  the  double  integral  from  the  iterated  integral  in  which  the  integrations  are 
performed  in  the  opposite  order;  thus, 


(x  +  y)d(x,y) 


=  j  dx  j  (x  +  y)dy  =  j  f.x 


y 


=  2. 


y=l 


dx 


A  plausible  partial  converse  of  Theorem  7.2.1  would  be  that  if  f  dy  f  f(x,  y)  dx 
exists  then  so  does  fR  f(x,  y)  d(x,  y);  however,  the  next  example  shows  that  this  need  not 
be  so. 

Example  7.2.3  If  /  is  defined  on  R  =  [0, 1]  x  [0, 1]  by 

f(x  y  )  =  \1Xy    if  y  iS  rationa1' 
\y       if  J  is  irrational, 
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then 

•  l 


/ 

Jo 


[  dy  f  f(x,y)dx=f  y  dy  =  -. 
Jo       Jo  Jo  1 


f(x,y)dx  =  y,    0  <  y  <  1, 

Jo 

and 


However,  /  is  not  integrable  on  R  (Exercise  7.2.7).  ■ 

The  next  theorem  generalizes  Theorem  7.2.1  to  R" . 

Theorem  7.2.3  Let  Ii,  I2,  ...  ,  In  be  closed  intervals  and  suppose  that  f  is  integrable 
on  R  =  1 1  x  I2  x  ■  •  ■  x  /„ .  Suppose  that  there  is  an  integer  p  in  {1,2, ...  ,n  —  1}  such  that 

Fp(Xp+i,Xp+2,  ...,Xn)  =  /  f(xi,x2,  ...,x„)d(xi,X2,  ...,*/>) 

J I\  XljX  —  Xlp 

exists  for  each  (xp+\ ,  xp+2,  ■  ■  ■  ,xn)  in  Ip+\  x  Ip+2  x  ■  •  ■  x  /„ .  Then 

J  Fp(Xp+l ,  Xp+2,  ■  ■  ■ ,  x„)  d(xp  +  \ ,  Xp+2,  ■  ■  ■ ,  x„) 

exists  and  equals  jR  f  (X)  dX.. 

Proof  For  convenience,  denote  (xp+\ ,  xp+2, . . . ,  xn)  by  Y.  Denote  R  =  I\  x  I2  x  ■  •  ■  x 
Ip  and  r  =  /p+i  x  7p+2  x  -  x/„.  LetP  =  R2, . . . ,  andQ  =  {Ti,  T2, . . . ,  rs} 
be  partitions  of  and  T,  respectively.  Then  the  collection  of  rectangles  of  the  form  R{  x  Tj 
(1  <  /  <  k,  1  <  j  <  s)  is  a  partition  P  of  i?;  moreover,  every  partition  P  of  is  of  this 
form. 

Suppose  that 

YyeTy.     l<y<j,  (7.2.8) 

so 

a=J2Fp(Yj)V(Tj)  (7.2.9) 
is  a  typical  Riemann  sum  of  Fp  over  Q.  Since 

Fp(Yj)  =  I  f  (xi,  x2,  ■  ■  ■ ,  x p,Y j)  d(xi,  x2, . . . ,  xp) 
Jr 

k 

=  T2  L  f(xi>x2,...,Xp,Yj)d(xi,x2,...,Xp), 
j  =  l  Jrj 

(7.2.8)  implies  that  if 

rriij  =  inf  |/(xi,x2, . . .  ,xp,  Y)  |  (xi,x2, . . . ,  xp)  e       Y  e 

and 

Mjy  =  sup  I  f{x\,  x2,...,  xp,  Y)  I  (x\,x2,...,xp)  e  Ri,Y  €  Tj 
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then 

k  k 

(=i  (=i 

Multiplying  this  by  V(Tj)  and  summing  from  j  =  1  to  j  =  s  yields 

s      k  s  s  k 

j2T,muv(RiW(Tj)  <  J2  FP(?j)v(Tj)  <  E  E  MijvfamTj), 

j=\i=i  j=\  y=i (=i 

which,  from  (7.2.9),  can  be  rewritten  as 

s/(P)  <  a  <  5/(P),  (7.2.10) 

where  s /  (P)  and  S /  (P)  are  the  lower  and  upper  sums  of  /  over  P.  Now  let  s  f  (Q)  and 
Sfp(Q)  be  the  lower  and  upper  sums  of  Fp  over  Q;  since  they  are  respectively  the  infimum 
and  supremum  of  the  Riemann  sums  of  Fp  over  Q  (Theorem  7.1.5),  (7.2.10)  implies  that 

Sf(F)  <  sFp(Q)  <  SFp(Q)  <  Sf(P).  (7.2.11) 

Since  /  is  integrable  on  R,  there  is  for  each  e  >  0  a  partition  P  of  R  such  that  5/(P)  — 
s /(P)  <  e,  from  Theorem  7.1.12.  Consequently,  from  (7.2.1 1),  there  is  a  partition  Q  of  T 
such  that  Sfp  (Q)  —  sfp  (Q)  <  e,  so  Fp  is  integrable  on  T ,  from  Theorem  7.1.12. 
It  remains  to  verify  that 

f  f(X)dX=  f  Fp(Y)d\.  (7.2.12) 
From  (7.2.9)  and  the  definition  of  fT  Fp(Y)  dY,  there  is  for  each  e  >  0  a  8  >  0  such  that 


I/, 


i>(Y)<iY-a 

r 


<  c   if   IIQH  <  5; 


that  is, 


a-e  <  j  Fp(Y)dY  <a +  e    if    ||Q||  <  <5. 


This  and  (7.2.10)  imply  that 

sf(P)-€<j  Fp(Y)dY  <  Sf(P)  +  e    if  ||P||<5, 
and  this  implies  that 

/  f(X)dX-e<  j  Fp(Y)dY<  I  f(X)dX  +  e.  (7.2.13) 
Jr  St  Jr 

since  /  f{X)dX=  /  f{X)dX  (Theorem  7.1.8)  and  €  can  be  made  arbitrarily  small, 
Jr  Jr 

(7.2.13)  implies  (7.2.12).  U 
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Theorem  7.2.4  Let  Ij  =  [aj,bj\,  1  <  j '  <  n,  and  suppose  that  f  is  integrable  on 
R  =  I \  x  I2  x  ■  •  ■  x  /„ .  Suppose  also  that  the  integrals 

Fp(xp+i,...,x„)  =  f(X)d(xi,x2,...,xp),     1  <  p  <n-\, 

J  I\xl2  —  *Ip 

exist  for  all 

(xp+i  ,xn)    in    Ip+i  x  •••  x  /„. 

Then  the  iterated  integral 

pb„  pb„-i  pb2  rb\ 

/     dxn  /       dx„-f  -  /     dx2  /  f(X)dxi 

Jan  Jan  —  i  Ja2 

exists  and  equals  jR  f  (X)  dX. 

Proof  The  proof  is  by  induction.  From  Theorem  7.2.1,  the  proposition  is  true  for  n  =  2. 
Now  assume  n  >  2  and  the  proposition  is  true  with  n  replaced  by  n  —  1 .  Holding  x„  fixed 
and  applying  this  assumption  yields 

fbn-l  fbn-2  fb2  fb\ 

Fn(xn)  =  /        dx„-i  /        dx„-2---       dx2  f(X)dxi. 

Jan-\  Ja„-2  Ja2  Jai 

Now  Theorem  7.2.3  with  p  =  n  —  1  completes  the  induction.  H 
Example  7.2.4  Let  R  =  [0, 1]  x  [1, 2]  x  [0, 1]  and 

f(x,y,z)  =  x  +  y  +  z. 

Then 


F\{y,Z.)  =  j  (x  +  y  +  z,)dx  =         +  xy  +  xz^ 
F^  =  l  F1(y,z)dy  =  ^  (l+y+z\dy 

2  \  |2 


1 

=  >  +y  +z, 

x=0  Z 


and 


J  fix,  y,  z)  d{x,  y,z)  =  J  F2(z)  dz  =  j  (2  +  z)  dz  =  (^2z  +  £\ 


1  _  5 
o  "  2' 


The  hypotheses  of  Theorems  7.2.3  and  7.2.4  are  stated  so  as  to  justify  successive  in- 
tegrations with  respect  to  X\,  then  x2,  then  X3,  and  so  forth.  It  is  legitimate  to  use  other 
orders  of  integration  if  the  hypotheses  are  adjusted  accordingly.  For  example,  suppose  that 
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12,  ...,/„}  is  a  permutation  of  {1, 2, . . . , «}  and  fR  f  (X)  dX  exists,  along  with 

f  f(X)d(xil,xi2,...,xij),     1  <  y  <  «  -  1,  (7.2.14) 

Jlix  Xli2X-Xlij 

for  each 

(xij+1,Xij+2,...,xin)    in       +  l  x  Iij+2x---x  Iin.  (7.2.15) 
Then,  by  renaming  the  variables,  we  infer  from  Theorem  7.2.4  that 

f(X)dX=         dxin  dxin_r--        dxh\      f(X)dxh.  (7.2.16) 

JR  Jain  •'"in-l  ^"'2  ^an 

Since  there  are  n  \  permutations  of  {1,2,. ..,«},  there  are  n\  ways  of  evaluating  a  mul- 
tiple integral  over  a  rectangle  in  W ,  provided  that  the  integrand  satisfies  appropriate  hy- 
potheses. In  particular,  if  /  is  continuous  on  R  and       i2,  ■ . . ,  in)  is  any  permutation  of 

{1, 2  ri),  then  /  is  continuous  with  respect  to  (xil ,  xt2, . . . ,  Xu)  on  Ijl  x  /,-2  x  ■  •  ■  x  /; . 

for  each  fixed  (x,- ,, ; ,  XjJ+2, . . . ,  Xjn)  satisfying  (7.2.15).  Therefore,  the  integrals  (7.2.14) 
exist  for  every  permutation  of  {1,2, ...  ,n}  (Theorem  7.1.13).  We  summarize  this  in  the 
next  theorem,  which  now  follows  from  Theorem  7.2.4. 

Theorem  7.2.5  Iff  is  continuous  on 

R  =  x  [a2,b2]  x  ■■■  x  [a„,bn], 

then  fR  f  (X)  dX  can  be  evaluated  by  iterated  integrals  in  any  of  the  n\  ways  indicated  in 
(7.2.16). 


Example  7.2.5  If  /  is  continuous  on  R  =  [a\,b\\  x  [a2,  b2]  x  [#3,  bj\,  then 

p^2  fb\ 


r  rt>3        ro2  r»\ 

I  f(x,y,z.)d(x,y,z.)  =  /     dz,  /     dy  I  f{x,y,z)dx 

J  R  Jai         J  a2         J  a\ 

ri>2  rbi  rb\ 

=  I     dy  I     dz.  I  f(x,y,z.)dx 

Ja2         Jai,  Jai 
fb3  rbi  rb2 

=  I     dz.  I     dx  I  f(x,y,z.)dy 

Jai         J a\  J  a2 

rbi         rbj,  p 

=  I     dx  I     dz  I 

Jai  Jai  Ja 


b\  fb^,  pb2 

f(x,y,z.)dy 

2 

fb2         rb\  />Z)3 

=  /     dy  I     dx  I  f(x,y,z)dz. 

Ja2  Ja\  J (I3 

pb\  pb2  nbj 

=  I     dx  I     dy  I  f(x,y,z)dz. 

Jai  J a2  Ja3 
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Integrals  over  More  General  Sets 

We  now  consider  the  problem  of  evaluating  multiple  integrals  over  more  general  sets.  First, 
suppose  that  /  is  integrable  on  a  set  of  the  form 

S  =  {(x,  y)  |  u(y)  <  x  <  v(y),  c  <  y  <  d\  (7.2.17) 

(Figure  7.2.1). 

If  u(y)  >  a  and  v(y)  <  b  for  c  <  y  <  d ,  and 

\f(x,y),  (x,y)&S, 
fs(x,y)=\  (7.2.18) 

then 

/  f(x,y)d(x,y)  =  /  fs(x,  y)  d(x,  y), 
Js  Jr 

where  R  =  [a,  b]  x  [c,  d]..  From  Theorem  7.2.1, 

/>  pd  i>b 

/  fs(x,y)d(x,y)  =       dy  fs(x,y)dx 
Jr  Jc  Ja 

provided  that  f  fs(x,  y)  dx  exists  for  each  y  in  [c,  d].  From  (7.2.17)  and  (7.2.18),  this 
integral  can  be  written  as 

rv(y) 

/       f(x,y)dx.  (7.2.19) 

Ju(y) 

Thus,  we  have  proved  the  following  theorem. 


y 

A 


Figure  7.2.1 

Theorem  7.2.6  If  f  is  integrable  on  the  set  S  in  (7.2.17)  and  the  integral  (7.2.19) 
exists  fore  <y<d,  then 

r  rd  pv(y) 

f(x,y)d(x,y)=       dy         f(x,y)dx.  (7.2.20) 

JS  Jc  Ju(y) 
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From  Theorem  7.1.22,  the  assumptions  of  Theorem  7.2.6  are  satisfied  if  /  is  continuous 
on  S  and  u  and  v  are  continuously  differentiable  on  [c,  d\. 

Interchanging  x  and  y  in  Theorem  7.2.6  shows  that  if  /  is  integrable  on 

5  =  {{x,  y)  |  u{x)  <y<  v(x),  a  <  x  <  b)  (7.2.21) 

(Figure  7.2.2)  and 

/•«(*) 

/  f{x,y)dy 

J  u{x) 

exists  for  a  <  x  <  b,  then 

f  f(x,y)d(x,y)=  f  dx  f  ()  f(x,y)dy.  (7.2.22) 

J  5  Ju(x) 


y  =  v(x) 


Figure  7.2.2 


Example  7.2.6  Suppose  that 


f(x,y)  =  xy 


and  S  is  the  region  bounded  by  the  curves  x  =  y2  and  x  =  y  (Figure  7.2.3).  Since  S  can 
be  represented  in  the  form  (7.2.17)  as 


(7.2.20)  yields 


S  =  {(x,  y)  |  y2  <  x  <  y,  0  <  y  <  1}  . 

/  xyd(x,y)=  I    dy  I    xy  dx, 
Js  Jo  Jy2 


which,  incidentally,  can  be  written  as 


/  xy  d(x,  y)  = 
Js 


f  ydy  ^ 

Jo  Jy2 


x  dx, 


Section  7.2  Iterated  Integrals  and  Multiple  Integrals  473 


since  y  is  independent  of  x.  Evaluating  the  iterated  integral  yields 


(x2 

VT 

:j 

/ 

y6 

4 

6 

1  //  _  y*\ 

2  V  4  6j 


24 


Figure  7.2.3 

In  this  case  we  can  also  represent  S  in  the  form  (7.2.21)  as 

S  =  {(x,  y)  |  x  <  y  <  V*,  0  <  x  <  l} ; 

hence,  from  (7.2.22), 


j^xy  d{x,y)  =  j  xdx  j     y  dy  =  j 


y=x  i 


x  dx 


-U'^-^-Kt-?) 

Example  7.2.7  To  evaluate 

./s 

S  =  {(x,y)  |-1<x<1,  0<y<l  +  \x\} 


1 

24' 


where 
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Figure  7.2.4 


we  invoke  Corollary  7.1.31  and  write 


where 


and 


(x  +  y)d(x,y)  =  (x  +  y)d(x,y)  +  (x  +  y)d(x,y), 
JS  JSi  Js2 


51  =  {(x,  y)\0<x  <l,  0<y  <l+x) 

52  =  {(x,  y)  |  -  1  <  x  <  0,  0  <  y  <  1  -  x} 


(Figure  7.2.5). 

From  Theorem  7.2.6, 


(x  +  y)d(x,y) 


/•l  rl+x 

I  dx  I  (x  +  y  )  dy  = 
Jo  Jo 

i  r1 

-  /    [(2x  +  l)2  -x2]dx 


/' 

Jo 


(x  +  yf 


l+x 


y=0 


"(2JC  +  1)3  X3 

5  3~ 


and 


(x  +  y)2 


/  (x  +  y)  d(x,  y)  =  I    dx  l      (x  +  y)  dy  =  l 
Js2  J-i      Jo  J-i 

If0         ,    ,       If      *3\  1°  1 
=  -/   (1  -  x2)  dx  =  -  [x  


l-x 


y=0 


Therefore, 


f  1  7 

/  (x  +  y)d(x,y)  =  2+  -  =  -. 
J  s 


3  3 
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y  =  1  +  x 


y=l-x 


Figure  7.2.5 

Example  7.2.8  To  find  the  area  A  of  the  region  bounded  by  the  curves 

y  =  x2  +  1    and    y  =  9  —  x2 

(Figure  7.2.6),  we  evaluate 


=  /  d(x, 
Js 


)■)■ 


where 


S  =  {(x,  y)  |  x2  +  1  <  y  <  9  -  x2 ,  -2  <  x  <  2} . 
According  to  Theorem  7.2.6, 

-2  r9-x2  r2 

-2 


dx  dy  =       [(9  -  x2)  -  (x2  +  1)]  dx 

-2        Jx2  +  1  J -2 


(8  -  2x2)  dx 


(  2x3 
(8X-- 


64 


Figure  7.2.6 
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Theorem  7.2.6  has  an  analog  for  n  >  2.  Suppose  that  /  is  integrable  on  a  set  5  of  points 
X  =  (x\,  Xi, . . . ,  x„)  satisfying  the  inequalities 

Uj(xj+i, . . .  ,x„)  <  xj  <  vj(xj+i,. .  .,x„),     1  <  j  <n-l, 

and 

a„  <  x„  <  b„. 

Then,  under  appropriate  additional  assumptions,  it  can  be  shown  by  an  argument  analogous 
to  the  one  that  led  to  Theorem  7.2.6  that 

f  pbn  pv„(x„)  nv2(x3,...,x„)  pVl(x2,—,Xn) 

I  f(X)dX=  I     dx„  I         dx„-i--  -  I  dxj  I  f(X)dxi. 

JS  Ja„  Ju„(x„)  JU2(x3,...,x„)  Ju\(x2,---,x„) 

These  additional  assumptions  are  tedious  to  state  for  general  n.  The  following  theorem 
contains  a  complete  statement  for  n  =  3. 

Theorem  7.2.7  Suppose  that  f  is  integrable  on 

S  =  {(x,  y,  z)  |  ui(y,  z)  <  x  <  v\(y,  z),  u2(z)  <  y  <  v2(z),  c  <  z.  <  d) , 
and  let 

S(z)  =  {(x,y)  |  Ui(y,z)  <  x  <  vi(y,z),  u2{z)  <  y  <  v2(z)} 
for  each  z.  in  \c,d\.  Then 

/  f(x,y,z.)  d(x,y,z.)  =       dz.  dy  f(x,y,z)dx, 

Js  Jc  J«2(z)  Ju\(y,z) 

provided  that 

/  f(x,y,z.)dx 

Jui(y,z) 

exists  for  all  {y,  z.)  such  that 

c  <  z.  <  d    and    u2(z.)  <  y  <  v2(z), 

and 

/     f{x,  y,z)d{x,y) 

JS(z) 

exists  for  all  z.  in  [c,  d]. 

Example  7.2.9  Suppose  that  /  is  continuous  on  the  region  S  in  R3  bounded  by  the 
coordinate  planes  and  the  plane 

x  +  y  +  2z.  =  2 

(Figure  7.2.7);  thus, 


Section  7.2  Iterated  Integrals  and  Multiple  Integrals  411 

z 
k. 


S  =  {(x,  y,z.)\0  <  x  <2- y  -2z,  0  <  y  <  2  -  2z,  0  <  z  <  1} . 
From  Theorem  7.2.7, 

/.  pi  p2-2z  p2-y-2z 

/  f(x,y,z)d(x,y,z)  =       dz  dy  f(x,y,z)dx. 

Js  Jo       Jo  Jo 

There  are  five  other  iterated  integrals  that  equal  the  multiple  integral.  We  leave  it  to  you 
to  verify  that 

r  r2         rl-y/2  n2-y-2z 

/  f(x,y,z)  d(x,y,z)  =      dy  dz  f(x,y,z)dx 

Js  Jo       Jo  Jo 

nl         p2-2z  p2-x-2z 

=      dz.  dx  f(x,y,z)dy 

Jo       Jo  Jo 

/>2  pl-x/2  p2-x-2z 

=  I   dx  I        dz.  f(x,y,z)dy 
Jo       Jo  Jo 

p2         p2-x  /•l—x/2—y/2 

=  I    dx  I       dy  I  f(x,y,z)dz 
Jo       Jo  Jo 

j-2         *2-y  /.l-ar/2-y/2 

=  j    dy  I       dx  I  f(x,y,z)dz  ■ 

Jo       Jo  Jo 

(Exercise  7.2.15). 

Thus  far  we  have  viewed  the  iterated  integral  as  a  tool  for  evaluating  multiple  integrals. 
In  some  problems  the  iterated  integral  is  itself  the  object  of  interest.  In  this  case  a  result 
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like  Theorem  7.2.6  can  be  used  to  evaluate  the  iterated  integral.  The  procedure  is  as  follows. 

(a)  Express  the  given  iterated  integral  as  a  multiple  integral,  and  check  to  see  that  the 
multiple  integral  exists. 

(b)  Look  for  another  iterated  integral  that  equals  the  multiple  integral  and  is  easier  to 
evaluate  than  the  given  one.  The  two  iterated  integrals  must  be  equal,  by  Theo- 
rem 7.2.6. 

This  procedure  is  called  changing  the  order  of  integration  of  an  iterated  integral. 


Example  7.2.10  The  iterated  integral 

I  =  f  dy  f 
Jo  Jo 


e-^dx 


is  hard  to  evaluate  because  e  ^x  ^2  has  no  elementary  antiderivative.  The  set  of  points 
(x,  y)  that  enter  into  the  integration,  which  we  call  the  region  of  integration,  is 


(Figure  7.2.8). 


S  =  {(x.y)  |0  <  x  <  y,  0  <  y  <  l} 


Figure  7.2.8 


Therefore, 


/  = 


d(x,y), 


(7.2.23) 


and  this  multiple  integral  exists  because  its  integrand  is  continuous.  Since  S  can  also  be 
written  as 

S  =  {(x,  y)  |  x  <  y  <  1,  0  <  x  <  1} , 
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Theorem  7.2.6  implies  that 

[  e-{x~l)1  d(x,y)=  f  e-{x~l)2dx  C  dy  =  -  j\x  -  \)e-{x-l)2 dx 

Js  J0  Jx  JO 


1  1 
o  L 


This  and  (7.2.23)  imply  that 

/-io-.-). 

Example  7.2.11  Suppose  that  /  is  continuous  on  [a,  oo)  and  y  satisfies  the  differen- 
tial equation 

y"(x)  =  f(x),    x>a,  (7.2.24) 

with  initial  conditions 

y(a)  =  y'{a)  =  0. 

Integrating  (7.2.24)  yields 

/oo = r  nodi, 

J  a 

since  y'{a)  =  0.  Integrating  this  yields 

y(x)  =  f  ds  f  f{t)dt, 

J  a         J  a 

since  y(a)  =  0.  This  can  be  reduced  to  a  single  integral  as  follows.  Since  the  function 

g(s,t)=f(t) 

is  continuous  for  all  (s,  t)  such  that  t  >  a,  g  is  integrable  on 

S  =  {(s,  t)  |  a  <  t  <  s,  a  <  s  <  x) 
(Figure  7.2.9),  and  Theorem  7.2.6  implies  that 

(  f{t)d(s,t)=  f  ds  f  f{t)dt  =  y(x).  (7.2.25) 

J  S  J  a         J  a 

However,  5  can  also  be  described  as 

S  =  {(s.  t)  1 1  <  s  <  x,  a  <  t  <  x) 
so  Theorem  7.2.6  implies  that 

/>  1>X  l>X  PX 

\  f(t)d(s,t)=  /    f(t)dt       ds  =  (x-t)f(t)dt. 

JS  Ja  Jt  Ja 

Comparing  this  with  (7.2.25)  yields 

y(x)  =  [  {x-t)f{t)dt. 

J  a 
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Figure  7.2.9 


7.2  Exercises 


1.  Evaluate 

(a)/.V-', 


(b)  j  dx  J  (x3  +  y4)  dy 

log2  rl 


(x  +  3y)  dx 


/•23T  p2  /.log  2  />1  2 

(c)  /     x  dx  I    sin  xy  dy  (d)  /       y  dy  I    xex  y  dx 

Jtt/2  J\  JO  JO 

2.    Let  I  j  =  [cij,bj],  1  <  j  <  3,  and  suppose  that  /  is  integrable  onff  =  /ix/2x  73. 
Prove: 

(a)   If  the  integral 

G{y,z)  =  /  f(x,y,z)dx 

J  a\ 

exists  for  (j,  z.)  €  h  x  ^3,  then  G  is  integrable  on  I2  x  ^3  and 
/  f(x,y,z)d(x,y,z)=  G(y,z)d(y,z). 

JR  JI2XI3 


(b)   If  the  integral 


ff(z)  =  /  f(x,y,z)d(x,y) 
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exists  for  z,  €  h,  then  H  is  integrable  on  I3  and 


f  f(x,y,z)d(x,y,z)=  [  *  H(z)dz. 

JR  Ja3 


HINT:  For  both  parts,  see  the  proof  of  Theorem  7.2.1. 
3.     Prove:  If  /  is  continuous  on  [a,  b]  x  [c,  d],  then  the  function 

fb 


F(y)  =  f  f(x,y)dx 

J  a 


is  continuous  on  [c,  d].  Hint:  Use  Theorem  5.2.14. 

4.  Suppose  that 

/ (x',  y')  >  f(x,y)    if    a  <  x  <  x'  <  b,  c  <  y  <  y'  <  d. 

Show  that  /  satisfies  the  hypotheses  of  Theorem  7.2.1  on  R  =  [a,  b]  x  [c,  d].  Hint: 
See  the  proof  of  Theorem  3.2.9. 

5.  Evaluate  by  means  of  iterated  integrals: 


(a)  [  (xy+  l)d(x,y);    R  =  [0, 1]  x  [1, 2] 

JR 

(b)  [  (2x +3y)d(x,y);    R  =  [1, 3]  x  [1, 2] 


(c)  f    -jL=^(x,j);    2?  =  [0, 1]  x  [0, 1] 

(d)  x  cos  x_y  cos  2ttx  d(x,  y);    R  =  [0,  ^]  x  [0, 2n] 

6.    Let  ,4  be  the  set  of  points  of  the  form  (2~m  p,  2~mq),  where  p  and  q  are  odd  integers 


and  m  is  a  nonnegative  integer.  Let 

f(x,y)  = 


l0,  (x,j)e^. 


Show  that  /  is  not  integrable  on  any  rectangle  R  =  [a,  b]  x  [c ,  d],  but 

rd  rd  rb 


I    dx  I    f(x,y)dy  =  dy 

J  a  J  c  J  c  J  a 


f(x,y)dx  =  (b -a)(d -c).  (A) 


HINT:  For  (A),  use  Theorem  3.5.6  and  Exercise  3.5.6. 
7.  Let 

ftx  y\  =  [2xy       if  y  is  rational, 
\y  if  _y  is  irrational, 

and  R  =  [0,  1]  x  [0,  1]  (Example  7.2.3). 
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(a)  Calculate  fR  f(x,  y)  d(x,  y)  and  Jo  fix,  y)  d(x,  y),  and  show  that  /  is  not 
integrable  on  R. 

(b)  Calculate  (j^f '(x ,  y)  dy^j  dx  and  JQl  yf^f(x,y)dy^j  dx. 
8.     Let  R  =  [0,  1]  x  [0,  1]  x  [0,  1],  R  =  [0,  1]  x  [0,  1],  and 


f(x,y,z)  = 


\2xy  +  2xz.  if  y  and  z  are  rational, 

!  y  +  2xz  if  y  is  irrational  and  z.  is  rational, 

I  2xy  +  z  if  y  is  rational  and  z  is  irrational, 

I  y  +  z.  if  y  and  z,  are  irrational. 


Calculate 

(a)  /  fix,  y,z)d(x,  y,z)  and  /  fix,  y,z)dix,  y,z) 

JR  JR 

(b)  L  fix,y,z)  dix,y)  and  \fix,y,z)dix,y) 

JR  JR 

/>  1  /"I  /"I  /"I  /»  1 

(c)  I    dy  I    fix,  y,  z)  dx  and  /    rfz  /    g?>>  /  fix,y,z)dx. 
Jo       Jo  «/  o       «/  o  ./o 

9.     Suppose  that  /  is  bounded  on  R  =  [a,  b]  x  [c,  d].  Prove: 

(a)  j  fix,y)dix,y)<  j  fix,y)dyj  dx.  Hint:  Use  Exercise  3.2.6(a). 

(b)  j  fix,y)dix,y)>  j  fix,  y)  dy\  dx.  Hint:  UseExercise  3.2.6(b). 

10.  Use  Exercise  7.2.9  to  prove  the  following  generalization  of  Theorem  7.2.1:  If  /  is 
integrable  on  R  =  [a,  b]  x  [c,  d],  then 

/  fix,y)dy    and     /  f(x,y)dy 

J  a  J  c 

are  integrable  on  [a,  b],  and 
j  ^Jfix,y)dyj  dx  =  j  ^  j  fjx,  y)  dy^  dx  =  jR  f(x<  di 

1 1 .  Evaluate 

(a)  f  ix-2y +  3z)dix,y,z);    R  =  [-2,0]  x  [2,5]  x  [-3,2] 
Jr 

(b)  I  e~x2~y2  smxsmzdix,y,z);    R  =  [-1, 1]  x  [0,2]  x  [0,  tt/2] 

(c)  f  (jcj  +  2xz  +  yz)  dix,  y,z);    R  =  [— 1, 1]  x  [0, 1]  x  [-1, 1] 
Jr 


x,y). 
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(d)    [  x2y3zexy2z2d(x,y,z);    R  =  [0, 1]  x  [0, 1]  x  [0, 1] 

JR 

12.  Evaluate 

(a)  J  (2x  +  y2)d(x,  y);    S  =  {(x,  y)  |  0  <  x  <  9  -  y2,  -3  <  y  <  3} 

(b)  /  2xy  d(x,  y);    S  is  bounded  by  y  =  x2  and  x  =  y2 
Js 

(c)  /  e1— S  =  {(x,y)\logy  <x  <log2y,  7t/2<y  <7t} 

Js  y 

13.  Evaluate  fs(x  +  y)d(x,y),  where  S  is  bounded  by  y  =  x2  and  y  =  2x,  using 
iterated  integrals  of  both  possible  types. 

14.  Find  the  area  of  the  set  bounded  by  the  given  curves. 

(a)  y  =  x2  +  9,  y  =  x2  -  9,  x  =  -1,  x  =  1 

(b)  y  =  x  +  2,  y  =  4  -  x,  x  =  0 

(c)  x  =  y2  —  4,  x  =  4  —  y2 

(d)  y  =  e2x,  y  =  —2x,  x  =  3 

15.  In  Example  7.2.9,  verify  the  last  five  representations  of  fs  f(x,  y,  z)  d(x,  y,z)  as 
iterated  integrals. 

16.  Let  S  be  the  region  in  K3  bounded  by  the  coordinate  planes  and  the  plane  x  + 
2y  +  3z  =  1.  Let  /  be  continuous  on  5.  Set  up  six  iterated  integrals  that  equal 
fs  f(x,y,z)d(x,y,z). 

17.  Evaluate 

(a)  /  x  d(x,  y,z);    S  is  bounded  by  the  coordinate  planes  and  the  plane 
Js 

3x  +  y  +  z.  =  2. 

(b)  J  yez  d(x,  y,  z);  5  =  {(x,  y,  z)  \  0  <  x  <  I,  0  <  y  <  V*,  0  <  z  <  y2} 

(c)  /  xyzd(x,y,z); 
Js 

S  =  Ux,  y,z)\0<y<  l,0<x<  yjl-y2,  0  <  z  <  yjx2  +  y2| 

(d)  J  yz,  d(x,  y,  z);  S  =  {(x,  y,  z)  |  z2  <  x  <  sfz,  0  <  y  <  z,  0  <  z  <  l} 

18.  Find  the  volume  of  S. 

(a)  S  is  bounded  by  the  surfaces  z.  =  x2  +  y2  and  z  =  8  —  x2  —  y2. 

(b)  5  =  {(x,  y,  z)  |  0  <  z  <  x2  +  y2,  (x,  y,  0)  is  in  the  triangle  with  vertices 
(0,1,0),  (0,0,0),  and  (1,0,0)} 

(c)  S  =  {(x,  y,  z)  |  0  <  y  <  x2,  0  <  x  <  2,  0  <  z  <  y2} 

(d)  5  =  {(jc,  y,z.)\x>0,  y  >  0,  0  <  z  <  4  -  Ax2  -  Ay2} 
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19.  Let  R  =  [m,  bj\  x  [a2,  £2]  x  ■  ■  ■  x  [«n,  Evaluate 

(a)  /R  (xi  +  x2  +  •••  +  xn)  dX  (b)  /g  (x\  +  x\  +  •••  +  x2)  dX 

(c)  /R  X1X2,  ■■■XndX 

20.  Assuming  that  /  is  continuous,  express 


/•1  r^jT-y2 
/    <fy  /    ,  f(x,y)dx 

Jl/2  J-^/l-y2 


as  an  iterated  integral  with  the  order  of  integration  reversed. 

21.     Evaluate  fs(x  +  y)  d(x,  y)  of  Example  7.2.7  by  means  of  iterated  integrals  in  which 
the  first  integration  is  with  respect  to  x. 


22.  Evaluate 


/    x  dx  I 

Jo  Jo 


y/x2  +  y2 

23.  Suppose  that  /  is  continuous  on  [a,  00), 

yw(jc)  =  /(*),  t>a, 

andj(a)  =  /(a)  =  •••  =  y^'^ia)  =  0. 

(a)  Integrate  repeatedly  to  show  that 

y(x)=       dtH  /    dt„-f-       dt2       f{h)dh.  (A) 

(b)  By  successive  reversals  of  orders  of  integration  as  in  Example  7.2.1 1,  deduce 
from  (A)  that 

yO)  =  7— ^  l\x-t)n-lf(t)dt. 

24.  Let  Tp  =  [0,  p]  x  [0,  p],  p  >  0.  By  calculating 

1(a)  =  lim  /    e~xy  sinax  d(x,  y) 
in  two  different  ways,  show  that 


Jo 


sin  ax  ji 

 ax  =  —    if    a  >  0. 

x  2 


7.3  CHANGE  OF  VARIABLES  IN  MULTIPLE  INTEGRALS 


In  Section  3.3  we  saw  that  a  change  of  variables  may  simplify  the  evaluation  of  an  ordinary 
integral.  We  now  consider  change  of  variables  in  multiple  integrals. 
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Prior  to  formulating  the  rule  for  change  of  variables,  we  must  deal  with  some  rather 
involved  preliminary  considerations. 

Jordan  Measurable  Sets 
In  Section  we  defined  the  content  of  a  set  S  to  be 


if  the  integral  exists.  If  R  is  a  rectangle  containing  S,  then  (7.3.1)  can  be  rewritten  as 


From  Exercise  7.1.27,  the  existence  and  value  of  V(S)  do  not  depend  on  the  particular 
choice  of  the  enclosing  rectangle  R.  We  say  that  S  is  Jordan  measurable  if  V(S)  exists. 
Then  V(S)  is  the  Jordan  content  of  S. 

We  leave  it  to  you  (Exercise  7.3.2)  to  show  that  S  has  zero  content  according  to  Defini- 
tion 7.1.14  if  and  only  if  5  has  Jordan  content  zero. 

Theorem  7.3.1  A  bounded  set  S  is  Jordan  measurable  if  and  only  if  the  boundary  of 
S  has  zero  content. 

Proof  Let  R  be  a  rectangle  containing  5.  Suppose  that  V(dS)  =  0.  Since  ijrs  is 
bounded  on  R  and  discontinuous  only  on  dS  (Exercise  2.2.9),  Theorem  7.1.19  implies  that 
fR  Vfs(X)  dX  exists.  For  the  converse,  suppose  that  dS  does  not  have  zero  content  and 
let  P  =  {R\,  R2,  ■  ■  ■ ,  Rk}  be  a  partition  of  R.  For  each  j  in  {1,2,...,  k}  there  are  three 
possibilities: 

1.  Rj  C  S;  then 


min  {fsQQ  I X  €  Rj}  =  max  {^(X)  |  X  €  Rj}  =  1. 

2.  Rj  n  S  ^  0  and  Rj  n  Sc  ^  0;  then 

min{^s(X)  |X  €  Rj}  =  0    and    max  {fsQQ  |  X  e  Rj}  =  1. 

3.  Rj  C  Se;then 

min^slX)  |X  €  Rj}  =  max  {^s(X)  |  X  e  fly}  =  0. 


(7.3.1) 


where  1]/$  is  the  characteristic  function  of  S,  defined  by 


1,  XeS, 
0,  X^S. 
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Let 

Hi  =  {j  |  Rj  C  S}    and     U2  =  {7  |  Rj  n  5  ^  0  and      n  Se  7^  0} .  (7.3.2) 
Then  the  upper  and  lower  sums  of  xfrs  over  P  are 

scp)  =  J2  v(rj)  +  E  vw> 

yell]  y6ll2  (7.3.3) 

=  total  content  of  the  subrectangles  in  P  that  intersect  S 

and 

s{P)  =  Y  V(Rj) 

yell,  (7.3.4) 
=  total  content  of  the  subrectangles  in  P  contained  in  S . 

Therefore, 

S(P)-s(P)=  v(Rj), 

which  is  the  total  content  of  the  subrectangles  in  P  that  intersect  both  S  and  Sc .  Since 
these  subrectangles  contain  3S,  which  does  not  have  zero  content,  there  is  an  €0  >  0  such 
that 

S(P)-s(P)  >e0 

for  every  partition  P  of  R.  By  Theorem  7.1.12,  this  implies  that  1/^5  is  not  integrable  on  R, 
so  S  is  not  Jordan  measurable.  TJ 

Theorems  7.1.19  and  7.3.1  imply  the  following  corollary. 

Corollary  7.3.2  If  f  is  bounded  and  continuous  on  a  bounded  Jordan  measurable  set 
S,  then  f  is  integrable  on  S. 

Lemma  7.3.3  Suppose  that  K  is  a  bounded  set  with  zero  content  and  e,  p  >  0.  Then 
there  are  cubes  C\,  Cj  Cr  with  edge  lengths  <  p  such  that  Cj  fl  K  ^  0,  1  <  j  <  r, 

r 

Kd\J  Cj,  (7.3.5) 

.7=1 


and 


Proof   Since  V(K)  =  0, 


£>(C;)<e. 


if  C  is  any  cube  containing  K.  From  this  and  the  definition  of  the  integral,  there  is  a  S  >  0 
such  that  if  P  is  any  partition  of  C  with  \\P\\  <  S  and  a  is  any  Riemann  sum  of  t^k  over 
P,  then 

0  <  a  <  e.  (7.3.6) 
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Now  suppose  that  P  =  {C\ ,  Ci, . . . ,  C^}  is  a  partition  of  C  into  cubes  with 

||P ||  <mm(p,8),  (7.3.7) 

and  let  C1}  C2  C*  be  numbered  so  that  Cy  n  K  ^  0  if  1  <  j  <  r  and  Cj  Ci  K  =  & 

if  r  +  1  <  /  <  k.  Then  (7.3.5)  holds,  and  a  typical  Riemann  sum  of  i/>^  over  P  is  of  the 
form 

r 

7=1 

with  Xj  €  Cj,  I  <  j  <  r.  In  particular,  we  can  choose  Xy  from  AT,  so  that  i/>jf  (Xy)  =  1, 
and 

r 

a  =  J]  F(Cy). 
7  =  1 

Now  (7.3.6)  and  (7.3.7)  imply  that  C\ ,  C2,  . . . ,  Cr  have  the  required  properties.  H 
Transformations  of  Jordan-Measurable  Sets 

To  formulate  the  theorem  on  change  of  variables  in  multiple  integrals,  we  must  first  con- 
sider the  question  of  preservation  of  Jordan  measurability  under  a  regular  transformation. 

Lemma  7.3.4  Suppose  that  G  :  W  — >  W  is  continuously  differentiable  on  a  bounded 
open  set  S,  and  let  K  be  a  closed  subset  of  S  with  zero  content.  Then  G(K)  has  zero 
content. 

Proof  Since  K  is  a  compact  subset  of  the  open  set  S,  there  is  a  p\  >  0  such  that  the 
compact  set 

KPI  =  {X|  dist(X,/0  <Pl} 
is  contained  in  S  (Exercise  5.1.26).  From  Lemma  6.2.7,  there  is  a  constant  M  such  that 

|G(Y)-G(X)|  <  M|Y-X|    if   X,Y  e  KP1.  (7.3.8) 

Now  suppose  that  e  >  0.  Since  V(K)  =  0,  there  are  cubes  C\,  C2,  Cr  with  edge 
lengths  s\,  Sz, Sf  <  P\l »Jn  such  that  Cy  n  K  7^  0,  1  <  j  <  r, 

r 

Kc[jCj, 
7  =  1 

and 

r 

V(Cj)  <  e  (7.3.9) 

7  =  1 

(Lemma  7.3.3).  For  1  <  j  <  r,  let  Xy  €  Cy  D  K.  If  X  e  Cy,  then 

|X-Xy|  <  Jy  V«  <  (01, 
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soX  €  K  and  |G(X)-G(Xy)|  <  M|X-Xy  |  <  M-Jrisj,  from (7.3.8).  Therefore,  G(Cj) 
is  contained  in  a  cube  C  j  with  edge  length  2M      s  / ,  centered  at  G(X , ) .  Since 

v(Cj)  =  (2Af  =  (2Mjn)nV{Cj), 

we  now  see  that 

r 

G(K)  c  |J  Cj 

.7=1 

and 

r  r 

where  the  last  inequality  follows  from  (7.3.9).  Since  (2M «Jn)n  does  not  depend  on  e,  it 
follows  that  V(G(K))  =  0.  U 

Theorem  7.3.5  Suppose  that  G  :  W  — >  W  is  regular  on  a  compact  Jordan  measur- 
able set  S.  Then  G(S)  is  compact  and  Jordan  measurable. 

Proof  We  leave  it  to  you  to  prove  that  G(S)  is  compact  (Exercise  6.2.23).  Since  S 
is  Jordan  measurable,  V(dS)  =  0,  by  Theorem  7.3.1.  Therefore,  V(G(dS))  =  0,  by 
Lemma  7.3.4.  But  G(9S)  =  3(G(5))  (Exercise  6.3.23),  so  V(d(G(S)))  =  0,  which 
implies  that  G(5)  is  Jordan  measurable,  again  by  Theorem  7.3.1.  H 

Change  of  Content  Under  a  Linear  Transformation 

To  motivate  and  prove  the  rule  for  change  of  variables  in  multiple  integrals,  we  must  know 
how  V(L,(S))  is  related  to  V(S)  if  S  is  a  compact  Jordan  measurable  set  and  L  is  a  nonsin- 
gular  linear  transformation.  (From  Theorem  7.3.5,  L(S)  is  compact  and  Jordan  measurable 
in  this  case.)  The  next  lemma  from  linear  algebra  will  help  to  establish  this  relationship. 
We  omit  the  proof. 

Lemma  7.3.6  A  nonsingular  n  x  n  matrix  A.  can  be  written  as 

A  =  EfcE*_1  ...Elt  (7.3.10) 

where  each  E,-  is  a  matrix  that  can  be  obtained  from  the  n  x  n  identity  matrix  I  by  one  of 
the  following  operations: 

(a)  interchanging  two  rows  of  I: 

(b)  multiplying  a  row  of  I  by  a  nonzero  constant; 

(c)  adding  a  multiple  of  one  row  of  I  to  another. 

Matrices  of  the  kind  described  in  this  lemma  are  called  elementary  matrices.  The  key  to 
the  proof  of  the  lemma  is  that  if  E  is  an  elementary  n  x  n  matrix  and  A  is  any  n  x  n  matrix, 
then  EA  is  the  matrix  obtained  by  applying  to  A  the  same  operation  that  must  be  applied 
to  I  to  produce  E  (Exercise  7.3.6).  Also,  the  inverse  of  an  elementary  matrix  of  type  (a), 
(b),  or  (c)  is  an  elementary  matrix  of  the  same  type  (Exercise  7.3.7). 

The  next  example  illustrates  the  procedure  for  finding  the  factorization  (7.3.10). 
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Example  7.3.1  The  matrix 


0  1  1 

1  0  1 

2  2  0 


is  nonsingular,  since  det(A)  =  4.  Interchanging  the  first  two  rows  of  A  yields 


Ai  = 


1  0  1 
0    1  1 

2  2  0 


=  EiA, 


where 


Ei  = 


0  1  0 

1  0  0 
0    0  1 


Subtracting  twice  the  first  row  of  Ai  from  the  third  yields 

= E2EiA, 


1  0  1 
0  1  1 
0  2-2 


where 


E? 


1    0  0 
0    1  0 
-2    0  1 


Subtracting  twice  the  second  row  of  A2  from  the  third  yields 


1  0  1 
0  1  1 
0  0-4 


E3E2E1  A, 


where 


Multiplying  the  third  row  of  A3  by  —  j  yields 


A4  = 


1  0  1 
0  1  1 
0    0  1 


0  0 

1  0 
-2  1 


=  E4A3E2E1A, 


where 


E4  = 


1  0 
0  1 


0 
0 


0   0   -I  j 
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Subtracting  the  third  row  of  A4  from  the  first  yields 


1  0  0 
0  1  1 
0    0  1 


=  E5A4E3E2E1A, 


where 


1    0  -1 
E5  =      0    1  0 
0    0  1 

Finally,  subtracting  the  third  row  of  A5  from  the  second  yields 

I  =  E6E5E4E3E2E1  A, 

where 

E6  = 


1  0  0 
0  1  -1 
0    0  1 


From  (7.3.11)  and  Theorem  6.1.16, 

A  =  (E6E5E4E3E2E1 )  ^  =  Ej  ^^>2  ^^3       ^E^  ^Eg 

Therefore, 


A  —  E6E5E4E3E2E1 , 


where 


1 

0 

0  " 

1 

0 

1 

Ei 

=  %J  = 

0 

1 

1 

,  E2 

=  E-i  = 

0 

1 

0 

0 

0 

1 

0 

0 

1 

"  1 

0 

0  " 

"  1 

0 

0 

E3 

=  A41  = 

0 

1 

0 

,  E4 

=  %1  = 

0 

1 

0 

0 

0 

-4 

0 

2 

1 

"  1 

0 

0  " 

"  0 

1 

0 

E5 

=  E2-1  = 

0 

1 

0 

,  E6 

=  Eri  = 

1 

0 

0 

2 

0 

1 

0 

0 

1 

(7.3.11) 


(Exercise  7.3.7(c)).  ■ 

Lemma  7.3.6  and  Theorem  6.1.7(c)  imply  that  an  arbitrary  invertible  linear  transforma- 
tion L  :  W  -+  R",  defined  by 

X  =  L(Y)  =  AY,  (7.3.12) 

can  be  written  as  a  composition 


where 


L  =  L/t  o  L/t_!  o  •  •  •  o  Li, 
L;(Y)=E;Y,  \<i<k. 


(7.3.13) 
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Theorem  7.3.7  If  S  is  a  compact  Jordan  measurable  subset  ofW  and  L  :  W  — ►  W 
is  the  invertible  linear  transformation's.  =  L(Y)  =  AY,  then 

7(L(5))  =  |det(A)|  7(5).  (7.3.14) 

Proof   Theorem  7.3.5  implies  that  L(S)  is  Jordan  measurable.  If 

V(L(R))  =  |det(A)|7(i?)  (7.3.15) 

whenever  R  is  a  rectangle,  then  (7.3.14)  holds  if  S  is  any  compact  Jordan  measurable 
set.  To  see  this,  suppose  that  e  >  0,  let  R  be  a  rectangle  containing  S,  and  let  P  = 
{Ri,  i?2,  •  •  • ,  Rk)  be  a  partition  of  R  such  that  the  upper  and  lower  sums  of  \j/s  over  P 
satisfy  the  inequality 

S(P)  -s{P)  <e.  (7.3.16) 
Let  Ki  and  U2  be  as  in  (7.3.2).  From  (7.3.3)  and  (7.3.4), 

S(P)  =  j2  v(Rj)  <  7(5)  <    *w  +  E      = s^)-  (73-17> 

y'eWi  ./eK,  jeU2 

Theorem  7.3.7  implies  that  L(i?i),  L(i?2),  •  •  • ,  L(i?fc)  and  L(S)  are  all  Jordan  measurable. 
Since 

U  Rj  CSC  |J 

it  follows  that 

L I  U  rj  ) c L<5> cL\  u  ^ ) • 

\./€lti       /  \jeSiUS2  J 

Since  L  is  one-to-one  on  R" ,  this  implies  that 

J2  V(L(Rj))  <  7(L(5))  <  £  7(L(*y))  +  £  7(L(*;)).  (7.3.18) 

If  we  assume  that  (7.3.15)  holds  whenever     is  a  rectangle,  then 

V(L(Rj))  =  |det(A)|7(#7),     1  <j  <  k, 

so  (7.3.18)  implies  that 

7(L(5)) 
I  det(A)  | 

This,  (7.3.16)  and  (7.3.17)  imply  that 

V(L(S)) 


7(5)- 


det(A)| 


<  e; 


hence,  since  e  can  be  made  arbitrarily  small,  (7.3.14)  follows  for  any  Jordan  measurable 
set. 
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To  complete  the  proof,  we  must  verify  (7.3.15)  for  every  rectangle 

R  =  [ai,bi]  x  [a2,b2]  x  •••  x  [a„,bn]  =  Ii  x  I2  x  •••  x  /„. 
Suppose  that  A  in  (7.3.12)  is  an  elementary  matrix;  that  is,  let 

X  =  L(Y)  =  EY. 

Case  1 .  If  E  is  obtained  by  interchanging  the  i  th  and  j  th  rows  of  I,  then 

{yr    if  r  7^  i  and  r  ^  j; 
yj    ifr  =  i; 
yi     if  r  =  j . 

Then  h(R)  is  the  Cartesian  product  of  I\,  I2, . . .,  I„  with  7,  and  Ij  interchanged,  so 

V(L(R))  =  V(R)  =  \det(E)\V(R) 

since  det(E)  =  —1  in  this  case  (Exercise  7.3.7(a)). 

CASE  2.  If  E  is  obtained  by  multiplying  the  rth  row  of  I  by  a,  then 


xr  = 


yr  if  r  j=  i , 
ayi    if  r  =  i . 


Then 

L,(R)  =  Ii  x  •  •  •  x        x  I-  x        x  •  •  •  x  /„, 
where  //  is  an  interval  with  length  equal  to  \a  \  times  the  length  of  7, ,  so 

V(L(R))  =  \a\V(R)  =  |det(E)|F(«) 

since  det(E)  =  a  in  this  case  (Exercise  7.3.7(a)). 

Case  3.  IfEis  obtained  by  adding  a  times  the  j  th  row  of  I  to  its  ( th  row  (j  ^  i),  then 


xr  = 


yr  ifr^z: 
y\  +  ayj    if  r  =  i . 


Then 

L(i?)  =  {(xi ,  x2,  ■  ■  ■ ,  xn)  I  at  +  axj  <  Xj  <  bj  +  axj  and  ar  <  xr  <  br  '\i r  ^  i}  , 
which  is  a  parallelogram  if  n  =  2  and  a  parallelepiped  if  n  =  3  (Figure  7.3.1).  Now 

V(L(R))  =  f  dX, 
JUR) 

which  we  can  evaluate  as  an  iterated  integral  in  which  the  first  integration  is  with  respect 
to  Xi .  For  example,  if  i  =  1 ,  then 

V(L(R))=  /     dxn  dxn-X---\     dx2  dxx.  (7.3.19) 

J  an  J  an—\  J  a.2  J  a\-\-axj 


Since 


(7.3.19)  can  be  rewritten  as 
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pbi+axj  pbi 

/  dyi=  dyi, 

Jai+ax/  Ja\ 


pbn  pb„-i  pb2  rbi 

V(L(R))  =  /     dx„  I        dxn-i  ■■■       dx2  /  d%\ 

Jan  J an—\  J  a2  Ja\ 

=  (b„  -a„)(b„-i  -a„-i)---(bi  -ai)  =  V(R). 
Hence,  V(L(R))  =  \  det(E)\V(R),  since  det(E)  =  1  in  this  case  (Exercise  7.3.7(a)). 


> 

...L 

 / 

'i  =  1,  /  =  2,  a : 

1 

1 

/ 

/ 

/ 

 >- 

y, 


i  =  2,  j  =  3,  a  >  0 


Figure  7.3.1 

From  what  we  have  shown  so  far,  (7.3.14)  holds  if  A  is  an  elementary  matrix  and  S  is 
any  compact  Jordan  measurable  set.  If  A  is  an  arbitrary  nonsingular  matrix, 
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then  we  can  write  A  as  a  product  of  elementary  matrices  (7.3.10)  and  apply  our  known 
result  successively  to  Li,  L2,  . . . ,      (see  (7.3.13)).  This  yields 


K(L(S))  =  |det(Ejfe)||det(Ejfc_0 
by  Theorem  6.1.9  and  induction. 


|detEi|K(S)  =  |det(A)|F(5), 


TJ 


Formulation  of  the  Rule  for  Change  of  Variables 


We  now  formulate  the  rule  for  change  of  variables  in  a  multiple  integral.  Since  we  are  for 
the  present  interested  only  in  "discovering"  the  rule,  we  will  make  any  assumptions  that 
ease  this  task,  deferring  questions  of  rigor  until  the  proof. 

Throughout  the  rest  of  this  section  it  will  be  convenient  to  think  of  the  range  and  domain 
of  a  transformation  G  :  W  — >•  R"  as  subsets  of  distinct  copies  of  R" .  We  will  denote  the 
copy  containing  Dq  as  E",  and  write  G  :  E"  — >•  R"  and  X  =  G(Y),  reversing  the  usual 
roles  of  X  and  Y. 

If  G  is  regular  on  a  subset  S  of  E",  then  each  X  in  G(S)  can  be  identified  by  specifying 
the  unique  point  Y  in  5  such  that  X  =  G(Y). 

Suppose  that  we  wish  to  evaluate  fT  f  (X)  dX,  where  T  is  the  image  of  a  compact  Jordan 
measurable  set  5  under  the  regular  transformation  X  =  G(Y).  For  simplicity,  we  take  S  to 
be  a  rectangle  and  assume  that  /  is  continuous  on  T  =  G(S). 

Now  suppose  that  P  =  {Ri,  R2, . . . ,  Rk}  is  a  partition  of  5  and  Tj  =  G(Rj)  (Fig- 
ure 7.3.2). 

v  y 

A 


R. 

j 

X  =  G(U) 


Figure  7.3.2 


Then 


k 

f  f(X)dX=J2  f  /(X)« 

•*T  /=i  JTj 


\dX  (7.3.20) 
(Corollary  7.1.31  and  induction).  Since  /  is  continuous,  there  is  a  point  X ;  in  Tj  such  that 


[   f(X)  dX  =  f(Xj)  [    dX=  f(Xj)V(Tj 
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(Theorem  7.1.28),  so  (7.3.20)  can  be  rewritten  as 

k 

f(X)dX  =       f(Xj)V(Tj).  (7.3.21) 

7=1 

Now  we  approximate  V(Tj).  If 

Xj  =  G(Yy),  (7.3.22) 

then  Yy  e  Rj  and,  since  G  is  differentiable  at  Yy , 

G(Y)  «  G(Yy)  +  G'(Yy)(Y  -  Yy).  (7.3.23) 

Here  G  and  Y  —  Yy  are  written  as  column  matrices,  G'  is  a  differential  matrix,  and 
means  "approximately  equal"  in  a  sense  that  we  could  make  precise  if  we  wished  (Theo- 
rem 6.2.2). 

It  is  reasonable  to  expect  that  the  Jordan  content  of  G(Rj )  is  approximately  equal  to  the 
Jordan  content  of  A(i?y),  where  A  is  the  affine  transformation 

A(Y)  =  G(Yy)  +  G(Yy)(Y-Yy) 

on  the  right  side  of  (7.3.23);  that  is, 

V(G(Rj))^V(A(Rj)).  (7.3.24) 

We  can  think  of  the  affine  transformation  A  as  a  composition  A  =  A3  o  A2  o  Ai,  where 

AX(Y)  =  Y-Yy, 
A2(Y)  =  G'(Yy)Y, 

and 

A3(Y)=G(Yy)+Y. 

Let  Rj  =  Ai(i?y).  Since  Ai  merely  shifts  Rj  to  a  different  location,  R'-  is  also  arectangle, 
and 

V(R'j)  =  V(Rj).  (7.3.25) 

Now  let  R"-  =  A.2(R'j).  (In  general,  Rr/-  is  not  a  rectangle.)  Since  A2  is  the  linear  transfor- 
mation with  nonsingular  matrix  G'(Yy),  Theorem  7.3.7  implies  that 

V(R'j))  =  I  detG'(Yy)|F(l?y)  =  \JG(Y j)\V(Rj),  (7.3.26) 

where  JG  is  the  Jacobian  of  G.  Now  let  R ".'  =  A3(R"-).  Since  A3  merely  shifts  all  points 
in  the  same  way, 

V(R'j')  =  V(Rj).  (7.3.27) 
Now  (7.3.24)-(7.3.27)  suggest  that 

V(Tj)  «  \JG(Yj)\V(Rj). 
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(Recall  that  7)  =  G(Rj).)  Substituting  this  and  (7.3.22)  into  (7.3.21)  yields 


f  f<todX*iJ2f(G(Yj))\JG(Yj)\V(Rj) 


But  the  sum  on  the  right  is  a  Riemann  sum  for  the  integral 


f  /(G(Y))|/G(Y)|rfY 


Js 


which  suggests  that 


f  f(X)dX  =  [  /(G(Y))|/G(Y)|rfY. 


Jt  Js 


We  will  prove  this  by  an  argument  that  was  published  in  the  American  Mathematical 
Monthly  [Vol.  61  (1954),  pp.  81-85]  by  J.  Schwartz. 

The  Main  Theorem 

We  now  prove  the  following  form  of  the  rule  for  change  of  variable  in  a  multiple  integral. 

Theorem  7.3.8  Suppose  that  G  :  E"  — >  R"  is  regular  on  a  compact  Jordan  measur- 
able set  S  and  f  is  continuous  on  G(S).  Then 


Since  the  proof  is  complicated,  we  break  it  down  to  a  series  of  lemmas.  We  first  observe 
that  both  integrals  in  (7.3.28)  exist,  by  Corollary  7.3.2,  since  their  integrands  are  continu- 
ous. (Note  that  S  is  compact  and  Jordan  measurable  by  assumption,  and  G(S)  is  compact 
and  Jordan  measurable  by  Theorem  7.3.5.)  Also,  the  result  is  trivial  if  7(S)  =  0,  since  then 
V(G(S))  =  0  by  Lemma  7.3.4,  and  both  integrals  in  (7.3.28)  vanish.  Hence,  we  assume 
that  V(S)  >  0.  We  need  the  following  definition. 

Definition  7.3.9  If  A  =  [a,j]  is  an  n  x  n  matrix,  then 


is  the  infinity  norm  of  A,  denoted  by  ||^4||oo- 

Lemma  7.3.10  Suppose  thatG  :  E"  — >•  R"  is  regular  on  a  cube  C  in  E" ,  and  let  A  be 
a  nonsingular  n  x  n  matrix.  Then 


(7.3.28) 


V(G(C))  <  |  det(A)|  [max  {HA^G'roiloo  |  Y  e 


C}]"  7(C). 


(7.3.29) 
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Proof  Let  s  be  the  edge  length  of  C .  Let  Yo  =  (c\,C2,  ■  ■  ■ ,  cn)  be  the  center  of  C,  and 
suppose  that  H  =  (y\,  y%, . . . ,  yn)  e  C .  If  H  =  (hi,  h-i, . . . ,  hn)  is  continuously  differen- 
tiable  on  C,  then  applying  the  mean  value  theorem  (Theorem  5.4.5)  to  the  components  of 
H  yields 

"  dh(Y) 

h (Y)  -  h (Yo)  =    -^r^iyj  -  CA  i  <  i  < », 
,=1  °yj 

where  Y,-  e  C .  Hence,  recalling  that 


H'(Y)  = 


dhi_ 
dyj 


applying  Definition  7.3.9,  and  noting  that  \y j  —  Cj  \  <  s/2, 1  <  j  <  n,  we  infer  that 

\hi(Y)- hi  (Y0)\  <  ^max{||H'(Y)||0O|YeC},     \<i  <n. 

This  means  that  H(C)  is  contained  in  a  cube  with  center  Xo  =  H(Yo)  and  edge  length 

smaxjHH'OOHoo  |Ye  C}. 

Therefore, 


(7.3.30) 


K(H(C))  <  [max{||H'(Y)||0O]n  |Ye  C)  s" 

=  [max{||H'(Y)||0O]n  |Ye  C}V(C). 
Now  let 

L(X)  =  A~XX 

and  set  H  =  L  o  G;  then 

H(C)  =  L(G(C))    and    H'  =  A_1G', 

so  (7.3.30)  implies  that 

V(L(G(C)))  <  [max  {HA^G'WIloo  |  Y  e  C}f  K(C).  (7.3.31) 
Since  L  is  linear,  Theorem  7.3.7  with  A  replaced  by  A-1  implies  that 

V(L(G(C)))  =  |det(A)-x|nG(C)). 
This  and  (7.3.31)  imply  that 

|det(A~x)|nG(C))  <  [max{||A-1G'(Y)||0O|YeC}]n  V(C). 
Since  det(A_1)  =  1/  det(A),  this  implies  (7.3.29).  H 

Lemma  7.3.11  If  G  :  E"  -s-  W  is  regular  on  a  cube  C  in  W,  then 

V(G(C))<  J  \  JG(Y)\dY.  (7.3.32) 
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Proof  Let  P  be  a  partition  of  C  into  subcubes  C\,  Ci,  ■  ■  ■ ,  Cfc  with  centers  Yi ,  Y2, . . . , 
Y*.  Then 

k 

V(G(C))  =       V(G(Cj))-  (7.3.33) 

.7=1 

Applying  Lemma  7.3.10  to  C,  with  A  =  G'(Ay)  yields 

V(G(Cj))  <  \JG(Yj)\  [max  (IKG'CYy^G'WHoo  |  Y  e  Cj}]"  V(Cj).  (7.3.34) 
Exercise  6.1.22  implies  that  if  e  >  0,  there  is  a  5  >  0  such  that 

max{||(G'(Y7-)r1G'(Y)||00|YeC7-}  <  1  +  e,     1  <  j  <  k,    if  \\P\\<&. 
Therefore,  from  (7.3.34), 

V(G(Cj))<(l  +  €)n\JG(Yj)\V(Cj), 

so  (7.3.33)  implies  that 

k 

V{G{C))  <  (1  +  e)"       \JG(Y j)\V(Cj)    if    ||P||  <  8. 
y'=i 

Since  the  sum  on  the  right  is  a  Riemann  sum  for  fc  |  /G(Y)|  dY  and  e  can  be  taken  arbi- 
trarily small,  this  implies  (7.3.32).  TJ 

Lemma  7.3.12  Suppose  that  S  is  Jordan  measurable  and  e,  p  >  0.  Then  there  are 
cubes  Ci,  C2,  Cr  in  S  with  edge  lengths  <  p,  such  that  Cj  C  S,  1  <  j  <  r, 
C°  n  C°  =  0  rf  i  ^  j,  and 

r 

V(S)<J2v(cj)+€-  (7.3.35) 

.7=1 

Proof    Since  S  is  Jordan  measurable, 

j  fs(K)dX  =  V(S) 

if  C  is  any  cube  containing  5.  From  this  and  the  definition  of  the  integral,  there  is  a  S  >  0 
such  that  if  P  is  any  partition  of  C  with  ||P  ||  <  8  and  a  is  any  Riemann  sum  of  ijrs  over 
P,  then  a  >  F(S)  —  e/2.  Therefore,  if  s(.P)  is  the  lower  sum  of  xffs  over  P,  then 

s(P)  >  V(S)  -  e    if    ||P||  <  8.  (7.3.36) 

Now  suppose  that  P  =  {C\,  C2,  •  •  ■ ,  C/t}  is  a  partition  of  C  into  cubes  with  ||.P||  < 
min(p,  8),  and  let  C\,  C2,  Ck  be  numbered  so  that  Cj  C  S  if  1  <  j  <  r  and 
Cy  n  5e  7^  0  if  7  >  r.  From  (7.3.4),  j(P)  =  This  and  (7.3.36)  imply 

(7.3.35).  Clearly,  C,°  n  C?  =  0  if  i  /  y .  H 
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Lemma  7.3.13  Suppose  thatG  :  E"  — >  W  is  regular  on  a  compact  Jordan  measur- 
able set  S  and  f  is  continuous  and  nonnegative  on  G(S).  Let 

Q{S)=  [     f(X)dX-  [  /(G(Y))|/G(Y)|dY.  (7.3.37) 
Jg(s)  Js 

Then  Q(S)  <  0. 

Proof  From  the  continuity  of  JG  and  /  on  the  compact  sets  S  and  G(S),  there  are 
constants  M\  and  M-i  such  that 

|/G(Y)|  <  Mi    if   Y  €  S  (7.3.38) 

and 

\f(X)\  <  M2    if   X  €  G(S)  (7.3.39) 

(Theorem  5.2.11).  Now  suppose  that  e  >  0.  Since  /  o  G  is  uniformly  continuous  on  S 
(Theorem  5.2.14),  there  is  a  S  >  0  such  that 

|/(G(Y))  -  /(G(Y'))|  <  e    if    |Y-  Y'|  <  J  and  Y,  Y'  e  S.  (7.3.40) 

Now  let  C\,  C2,  ■■  ■ ,  Cr  be  chosen  as  described  in  Lemma  7.3.12,  with  p  =  8/ *Jn.  Let 


Si  =  <J  Y  e  S  I  Y  i  [J  C, 


Then  <  e  and 


S=(|JCyJu5i.  (7.3.41) 

Suppose  that  Yi,  Y2, Yr  are  points  in  C\,  C2,  ■  ■  ■ ,  Cr  and  Xy  =  G(Yy),  1  <  j  <  r. 
From  (7.3.41)  and  Theorem  7.1.30, 


Q(S)=  [      f(X)dX-  f  /(G(Y))|/G(Y)|rfY 
Jg(Si)  Jsi 

+  T[       f(X)dX-Y  f  f(G(Y))\JG(Y)\dY 

p[JG(Cj)  j=lJCj 

=  [      f(X)dX-  [  /(G(Y))|/G(Y)|rfY 
+  E/  <J<R-f(Aj))dX 

JTi  •'G(Cy) 

+  £  f  ((/(G(Yy))-/(G(Y)))|/(G(Y)|dY 
+  J2f(Xj)(v(G(Cj))-  jf  |/G(Y)|^. 
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Since  /(X)  >  0, 

/(G(Y))|/G(Y)|rfY  >0, 


Is, 


IS! 

and  Lemma  7.3.1 1  implies  that  the  last  sum  is  nonpositive.  Therefore, 


Q(S)  <h+h  +  h,  (7.3.42) 


where 


and 


h  =  f    f(x)  dx,  i2  =  T  I     |/(X)  -  f(Xj)\ dx, 

h  =  £  /    l/(GXYv))  -  /(G(Y))||/G(Y)|  dY. 
j=iJcJ 

We  will  now  estimate  these  three  terms.  Suppose  that  e  >  0. 

To  estimate  I\,  we  first  remind  you  that  since  G  is  regular  on  the  compact  set  5,  G  is 
also  regular  on  some  open  set  0  containing  S  (Definition  6.3.2).  Therefore,  since  Si  C  S 
and  V(Si)  <  €,  Si  can  be  covered  by  cubes  T\,  2*2, Tm  such  that 

r 

Y,V(Tj)<e  (7.3.43) 

./  =  ! 

and  G  is  regular  on  UT=i  -0  •  Now, 

h  <  M2V(G(S1))  (from  (7.3.39)) 

m 

<M2J2  V(G(Tj ))  (since  Si  C  UJ=1  Tj) 

.7=1 


<M2)    /    \JG(Y)\dY    (from  Lemma  7.3.11) 
j=i  JtJ 

<  M2M1e  (from  (7.3.38)  and  (7.3.43)). 


To  estimate  I2,  we  note  that  if  X  and  X,  are  in  G(Cj )  then  X  =  G(Y)  and  X,  =  G(Y, ) 
for  some  Y  and  Y ;  in  Cj .  Since  the  edge  length  of  C,  is  less  than  8/ *Jn,  it  follows  that 
|Y  -  Yj  |  <  S,  so  |/(X)  -  f(X/)\  <  e,  by  (7.3.40).  Therefore, 

r 

i2<€j2nG(Cj)) 


\JG(Y)\dY    (from Lemma  7.3.11) 


J=iJCJ 


<  eM1       v(Cj)  (from  (7.3.38)) 

<  eMiViS)  (since  Urj=1  Cj  C  S). 
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To  estimate  h,  we  note  again  from  (7.3.40)  that  |/(G(Yy))  -  /(G(Y))|  <  e  if  Y  and 
Yj  are  in  Cj .  Hence, 

h<eJ2f  \JG(X)\dY 

r 

<  Mie      v(cj)    (from  (7-3.38) 

<  Mx  V(S)e 

because  [fJ=l  Cj  C  S  and  C°  n  C°  =  0  if  i  ^  y. 

From  these  inequalities  on  I\,  I2,  and  ^3,  (7.3.42)  now  implies  that 

Q(S)  <M1(M2+2V(S))€. 

Since  e  is  an  arbitrary  positive  number,  it  now  follows  that  Q(S)  <  0.  TJ 

Lemma  7.3.14  Under  the  assumptions  of  Lemma  7.3.13,  Q(S)  >  0. 

Proof  Let 

Gi  =  G-\    Si  =  G(S),     /1  =  (|/G|)/  o  G,  (7.3.44) 

and 

Qi(Si)=  [        MY)dY-  [  MGiOQyjGiQQldX.  (7.3.45) 

Since  Gi  is  regular  on  Si  (Theorem  6.3.3)  and  f\  is  continuous  and  nonnegative  on 
Gi(Si)  =  S,  Lemma  7.3.13  implies  that  ^i('S'i)  —  0-  However,  substituting  from  (7.3.44) 
into  (7.3.45)  and  again  noting  that  Gi  (Si)  =  S  yields 

Qi(Si)=  f  /(G(Y))|/G(Y)|  dY 

Js  r  (7.3.46) 
-  /  /(G(G-1(X)))|/G(G-1(X))||/G-1(X)|rfX. 

Since  G(G_1  (X))  =  X,  / (G(G~X  (X)))  =  /(X).  However,  it  is  important  to  interpret  the 
symbol  /G(G_1  (X))  properly.  We  are  not  substituting  G_1  (X)  into  G  here;  rather,  we  are 
evaluating  the  determinant  of  the  differential  matrix  of  G  at  the  point  Y  =  G_1  (X).  From 
Theorems  6.1.9  and  6.3.3, 

|/G(G-1(X))||/G~1(X)|  =  1, 
so  (7.3.46)  can  be  rewritten  as 

0iGSi)=  f  /(G(Y))|/G(¥)|rfY-  f     f(X)dX  =  -Q(S). 
Js  Jg(s) 

Since  gi(Si)  <  0,  it  now  follows  that  Q(S)  >  0.  TJ 
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We  can  now  complete  the  proof  of  Theorem  7.3.8.  Lemmas  7.3.13  and  7.3.14  imply 
(7.3.28)  if  /  is  nonnegative  on  5.  Now  suppose  that 

m  =  min  {/(X)  |  X  e  G(S)}  <  0. 

Then  /  —  m  is  nonnegative  on  G(S),  so  (7.3.28)  with  /  replaced  by  f  —  in  implies  that 

f     (f(X)-m)dX=  f  (f(G(Y)-m)\JG(Y)\dY.  (7.3.47) 

JG(S)  JS 

However,  setting  /  =  1  in  (7.3.28)  yields 

f      dX=  f  \JG(Y)\dY, 

so  (7.3.47)  implies  (7.3.28).  TJ 

The  assumptions  of  Theorem  7.3.8  are  too  stringent  for  many  applications.  For  example, 
to  find  the  area  of  the  disc 

{(x,y)\x2  +  y2  <  1}, 
it  is  convenient  to  use  polar  coordinates  and  regard  the  circle  as  G(S),  where 


G(r,  9)  = 


r  cos  9 
r  sin  (9 


and  5  is  the  compact  set 


S  =  {(r,  9)  |0  <  r  <  1,  0<  6  <  2n} 


(7.3.48) 


(7.3.49) 


(Figure  7.3.3). 


G(S) 

s\  x1  +y2  = 

 — 

Figure  7.3.3 
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Since 

r  cos  8    —r  sin  ( 


G'(r,  9) 


sin  0      r  cos  ( 


it  follows  that  JG(r,  6)  =  r .  Therefore,  formally  applying  Theorem  7.3.8  with  /  =  1 
yields 


f  dX=  f  rd(r,6)=  f  r  dr  f  d6 
Jg(S)  Js  Jo  Jo 


Although  this  is  a  familiar  result,  Theorem  7.3.8  does  not  really  apply  here,  since  G(r,  0)  = 
G(r,  2tt),  0  <  r  <  1,  so  G  is  not  one-to-one  on  5,  and  therefore  not  regular  on  S . 

The  next  theorem  shows  that  the  assumptions  of  Theorem  7.3.8  can  be  relaxed  so  as  to 
include  this  example. 

Theorem  7.3.15  Suppose  that  G  :  E"  — >•  R"  is  continuously  differentiable  on  a 
bounded  open  set  N  containing  the  compact  Jordan  measurable  set  S,  and  regular  on 
5°.  Suppose  also  that  G(S)  is  Jordan  measurable,  f  is  continuous  on  G(5),  and  G(C)  is 
Jordan  measurable  for  every  cube  C  C  N.  Then 

[  f(X)dX=  f  f(G(Y))\JG(Y)\dY.  (7.3.50) 
Jg(s)  Js 

Proof  Since  /  is  continuous  on  G(5)  and(|/G|)/ oG  is  continuous  on  5,  the  integrals 
in  (7.3.50)  both  exist,  by  Corollary  7.3.2.  Now  let 

p  =  dist  (9S,  Nc) 

(Exercise  5.1.25),  and 

P  =  {Y|  dist(Y,  dS)}  <  -. 

Then  P  is  a  compact  subset  of  N  (Exercise  5.1.26)  and  dS  C  P°  (Figure  7.3.4). 

Since  S  is  Jordan  measurable,  V(dS)  =  0,  by  Theorem  7.3.1.  Therefore,  if  e  >  0,  we 
can  choose  cubes  C\,  C2,  ■  ■  ■ ,  Ck  in  P°  such  that 

k 

dS  C  (J  C°  (7.3.51) 

7  =  1 

and 

k 

nCj)  <  €  (7.3.52) 

7  =  1 

Now  let  Si  be  the  closure  of  the  set  of  points  in  S  that  are  not  in  any  of  the  cubes  C\ , 
C2,  . . . ,  Ck',  thus, 


Si  =  sn{vjkj=lCj)c 
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Because  of  (7.3.51),  Si  fl  9S  =  0,  so  Si  is  a  compact  Jordan  measurable  subset  of  S°. 
Therefore,  G  is  regular  on  Si,  and  /  is  continuous  on  G(Si).  Consequently,  if  Q  is  as 
defined  in  (7.3.37),  then  Q(Si)  =  0  by  Theorem  7.3.8. 


N  =  open  set  bounded  by  outer  curve 
S  =  closed  set  bounded  by  inner  curve 


Figure  7.3.4 


Now 

(Exercise  7.3.1 1)  and 

lfiOSnsf)|  < 

But 


Q(S)  =  Q(Si)  +  Q(S  n  so  =  Q(s  n  sxc) 


[  f(X)dX  +  [  /(G(Y))|/G(Y)|JY 
JG(SnSf)  Jsnsf 


(7.3.53) 


f  /(G(Y))|/G(Y)|</Y 


MiM2F(S  n  Sxc),  (7.3.54) 


where  Mx  and  M2  are  as  defined  in  (7.3.38)  and  (7.3.39).  Since  S  D  Sf  C  uj:=1C/, 
(7.3.52)  implies  that  K(S  n  Sf )  <  e;  therefore, 


/ 

Jsns$ 


/(G(Y))|/G(Y)|JY 


<  MiM2e, 


(7.3.55) 


from  (7.3.54).  Also 


/  /(X) 
^G(snSf) 


M2V(G(S  n  Sxe))  <  M2  J2  nG{Cj)).  (7.3.56) 

.7  =  1 
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By  the  argument  that  led  to  (7.3.30)  with  H  =  G  and  C  =  Cj, 

V(G(Cj))  <  [mtt{||G'COII«  |  Y  e  Cj}]"  V(Cj), 
so  (7.3.56)  can  be  rewritten  as 


L 


G(snsf) 


f(X)dX 


<M2[max{||G'(Y)||00|Ye  P}f  e. 


because  of  (7.3.52).  Since  e  can  be  made  arbitrarily  small,  this  and  (7.3.55)  imply  that 
Q(S  n  Sf )  =  0.  Now  Q(S)  =  0,  from  (7.3.53).  H 

The  transformation  to  polar  coordinates  to  compute  the  area  of  the  disc  is  now  justi- 
fied, since  G  and  S  as  defined  by  (7.3.48)  and  (7.3.49)  satisfy  the  assumptions  of  Theo- 
rem 7.3.15. 

Polar  Coordinates 


x 

y 


r  cos  9 
r  sin  6 


(7.3.57) 


If  G  is  the  transformation  from  polar  to  rectangle  coordinates 

G(r,  6)  = 

then  /G(r,  9)  =  r  and  (7.3.50)  becomes 

/      f(x,  y)d(x,  y)  =  I  f(r  cos  9,  r  sin#)r  d(r,  9) 
Jg(S)  Js 

if  we  assume,  as  is  conventional,  that  S  is  in  the  closed  right  half  of  the  r  (9 -plane.  This 
transformation  is  especially  useful  when  the  boundaries  of  S  can  be  expressed  conveniently 
in  terms  of  polar  coordinates,  as  in  the  example  preceding  Theorem  7.3.15.  Two  more 
examples  follow. 


Example  7.3.2  Evaluate 


where  T  is  the  annulus 


7  =  JT(x2  +  y)d(x,y), 
T  =  {(x,y)  \  1  <x2  +  y2  <  4} 


(Figure  7.3.5(b)). 

Solution  We  write  T  =  G(S),  with  G  as  in  (7.3.57)  and 

S  =  {(r,9)  \  l  <r  <2,  0  <  9  <2it} 
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(Figure  7.3.5(a)).  Theorem  7.3.15  implies  that 

/  =  /  (r2  cos2  6  +  r  sin  6)r  d(r,  6), 
Js 

which  we  evaluate  as  an  iterated  integral: 


r2dr  J     (r  cos2  9  +  sin 0)^0 


(a)  (b) 

Figure  7.3.5 

Example  7.3.3  Evaluate 

where  T  is  the  region  in  the  xj-plane  bounded  by  the  curve  whose  points  have  polar  coor- 
dinates satisfying 

r  =  1  -  cos#,     0  <  6  <  7t 

(Figure  7.3.6(b)). 

Solution  We  write  T  =  G(S),  with  G  as  in  (7.3.57)  and  S  the  shaded  region  in 
Figure  7.3.6(a).  From  (7.3.50), 

/  =  /  (rsin9)rd(r,6), 
Js 
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which  we  evaluate  as  an  iterated  integral: 

•1-COS0 


I  =  (   sin  9  d9  f 
Jo  Jo 


=  _(i  -cos(9)4 
12  ' 


1  fn 

r2dr  =  -  /    (1  —  cos  6»)3  sin  6>  <i6» 
3  Jo 


(a)  (b) 

Figure  7.3.6 


Spherical  Coordinates 

If  G  is  the  transformation  from  spherical  to  rectangular  coordinates, 


x 

y 

z. 


=  G(r,  6, 4>) 


r  cos  9  cos  <p 
r  sin  9  cos  <p 
r  sin^ 


then 


G'(r,  9,  0)  = 
and  JG(r,  9,  (p)  =  r2  cos  (p,  so  (7.3.50)  becomes 


cos(9cos</>    —r  sin  9  cos  (p    —r  cos9  sincp 
sin  9  cos  (p      r  cos  (9  cos  0    —r  sin  9  sincp 
sincp  0        r  cos  cp 


\  f(x,y,z)d(x,y,z) 
Jg(S) 

=  I  f(r  cos  9  cos  cp,  r  sin  9  cos  cp,  r  sin  cp)r2  cos  cp  d(r,  9,  cp) 
Js 

if  we  make  the  conventional  assumption  that  \cp\  <  n/2  and  r  >  0. 


(7.3.58) 


(7.3.59) 
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Example  7.3.4  Let  a  >  0.  Find  the  volume  of 

T  =  {(x,  y,  z)  |  x2  +  y2  +  z2  <  a2,  x  >  0,  y  >  0,  z  >  0} 
which  is  one  eighth  of  a  sphere  (Figure  7.3.7(b)). 


Solution  We  write  T  =  G(5)  with  G  as  in  (7.3.58)  and 

S  =  {(r,  9, 0)  |  0  <  r  <  a,  0<6<  n/2,  0  <  4>  <  ji/2} 
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(Figure  7.3.7(a)),  and  let  /  =  1  in  (7.3.59).  Theorem  7.3.15  implies  that 

V(T)=  I      dX=  I  r2  cos  <j>  d(r,  9,  <j>) 
JG(S)  Js 

=LrdrL  deL  cos^=(t)(2) 


Example  7.3.5  Evaluate  the  iterated  integral 


V  a2— x2         f s/ a2~x2—y2 


i  = 


I    x  dx  I  dy  I 

Jo         Jo  Jo 


zdz    (a  >  0). 


Solution  We  first  rewrite  /  as  a  multiple  integral 


x,y,z) 


where  G  and  S  are  as  in  Example  7.3.4.  From  Theorem  7.3.15, 


=  /      xz.  d(. 
Jg(S) 


pa  nit/2  p7T/z  fa5\  /1\  a5 

=  J    r4drj      cos9d9J      cos2  </>  sin0  d<j>  =  (7.3.1)  l~ J  =  — 


1=1  (r  cos  9  cos  (p)(r  sin (p)(r  cos  (p)  d(r,  9,  (p) 
Is 

-jt/2  rn/2 


Other  Examples 

We  now  consider  other  applications  of  Theorem  7.3.15. 
Example  7.3.6  Evaluate 

/  =  j  (x  +4y)d(x,y), 

where  T  is  the  parallelogram  bounded  by  the  lines 

x  +  y  =  1,    x  +  y  =  2,    x  —  2y  =  0,    and    x  —  2y  =  3 
(Figure  7.3.8(b)). 

Solution  We  define  new  variables  u  and  v  by 

=  F(x,y)  = 


x  +  y 
x  —  2y 
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=  F-1(M,v) 


1  2 


(a) 


x  -  2y  =  0 


Then 


x 

y 


F-1(w,  v) 


2u  +  v 
3 

u  —  V 


JF-\u,v) 


2  1 

3  3 

1  _1 

3  3 


andr  =  F_1(5),  where 


S  =  {(u,v)  |1<m<2,  0<i><3} 
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(Figure  7.3.8(a)).  Applying  Theorem  7.3.15  with  G  =  F"1  yields 

f  (2u  +  v      Au-4v\  fl\   ,  If 
I  =      y—^-  +  3  J  (  3  J  d(u,  v)  =  -  jf  (2m  -  v)  d(u,  v) 


3 

-3  /-2 


—  —  f  dv  f  (2u  —  v)  du  =  —  f  (u2  —  Mii)  dv 
3  Jo       Ji  3  Jo  L=i 


Example  7.3.7  Evaluate 

--L 


(x2-y2)2  0Ax2y2  (v2 


(x2  +  y2)d(x,y), 


where  T  is  the  annulus  T  =  {(x,  y)  |  a2  <  x2  +  y2  <  b2}  with  a  >  0  and  b  >  0  (Fig 
ure  7.3.9(a)). 


(a) 


(b) 


Figure  7.3.9 


Solution  The  forms  of  the  arguments  of  the  exponential  functions  suggest  that  we 
introduce  new  variables  u  and  v  defined  by 


*  -  y 

2xy 


and  apply  Theorem  7.3. 1 5  to  G  =  F_1 .  However,  F  is  not  one-to-one  on  7"°  and  therefore 
has  no  inverse  on  7"°  (Example  6.3.4).  To  remove  this  difficulty,  we  regard  T  as  the  union 
of  the  quarter-annuli  T\,  T%,  Tj,  and  T4  in  the  four  quadrants  (Figure  7.3.9)(b)),  and  let 


,  =  I  eix2-y2)2e4x2y2(x2  +  y2)  d(x,  y). 
•'T.i 
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Since  the  pairwise  intersections  of  T\,  T%,  7~3,  and  T4  all  have  zero  content,  I  =  I\  + 
I7.  +  h  +  I  a  (Corollary  7.1.31).  Theorem  7.3.8  implies  that  I\  =  1%  =  I3  =  I4  (Exer- 
cise 7.3.12),  so  /  =  4/i .  Since  I\  does  not  contain  any  pairs  of  distinct  points  of  the  form 
(xq,  yo)  and  (— xo,  —yo),  F  is  one-to-one  on  T\  (Example  6.3.4), 


(Figure  7.3.10(b)), 


u2  +  v2  <  b4, 

v  >  0} 

V 

I 

Cr 

1  >v 

(a) 


(b) 


Figure  7.3.10 


and  abranch  G  of  F  1  can  be  defined  on  Si  (Example  6.3.8).  Now  Theorem  7.3.15  implies 
that 

h  =  [  e{x2-y2)2e4x2yL(x2  +  y2)\JG(u,v)\d(u,v), 
JSi 


where  x  and  y  must  still  be  written  in  terms  of  u  and  v.  Since  it  is  easy  to  verify  that 

JF(x,y)  =  4(x2  +  y2) 

and  therefore 


JG(u,v)  = 


1 


doing  this  yields 


4(x2  +  y2)' 

h  =  \  I  e"2+v2d(u,v). 
4  Js, 


(7.3.60) 


To  evaluate  this  integral,  we  let  p  and  a  be  polar  coordinates  in  the  i/u-plane  (Figure  7.3.1 1) 
and  define  H  by 


=  H(p,a)  = 


p  cos  a 
psina 


then  Si  =  H(Si),  where 


Si  =  {(p,  a)  I  a2  <  p  <  b2,  0  <  a  <  n} 
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(Figure  7.3.10(a));  hence,  applying  Theorem  7.3.15  to  (7.3.60)  yields 

h  =  \  L  ep2\JK(p,a)\d(p,a)=]-  [  pe"2  d(p,a) 

4  JS i  4  JS, 


hence, 


=  i-rda  f 

4  Jo  Ja^ 


pe^  dp  =  ; 


I=Ah  =  n-{ebA-e°\ 


Figure  7.3.11 

Example  7.3.8  Evaluate 

j-  I  exl+X2+-+x"d(x1,x2,...,x„), 

where  T  is  the  region  defined  by 

ai  <  x\  +  X2  +  h  Xj  <  bi,     1  <  i  <  n. 


Solution  We  define  the  new  variables  y\,  y2,  . . . ,  yn  by  Y  =  F(X),  where 

fi(X)  =  xi  +  x2H  Hi,  l<i<n. 

If  G  =  F"1  then  7"  =  G(5),  where 

S  =  [ai.ii]  x  [a2,  b2]  x  •■•  x  [a„,Z>„], 
and  /G(Y)  =  1,  since  /F(X)  =  1  (verify);  hence,  Theorem  7.3.8  implies  that 
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1=1  ey"d(yi,y2,...,yn) 


Js 


I     dyi        dy2---\       dyn-\  I  ey"dy„ 


=  (bi  -  ai)(b2  -  a2)  •  •  ■  (pn-i  -  an-1)(eb"  -  ea"). 


7.3  Exercises 


1.     Give  a  counterexample  to  the  following  statement:  If  Si  and  S2  are  disjoint  subsets 
of  a  rectangle  R,  then  either 


2.  Show  that  a  set  E  has  content  zero  according  to  Definition  7.1.14  if  and  only  if  E 
has  Jordan  content  zero. 

3.  Show  that  if  Si  and  S2  are  Jordan  measurable,  then  so  are  Si  U  S2  and  Si  fl  S2. 

4.  Prove: 

(a)  If  S  is  Jordan  measurable  then  so  is  S,  and  V(S  )  =  V(S).  Must  S  be  Jordan 
measurable  if  S  is? 

(b)  If  T  is  a  Jordan  measurable  subset  of  a  Jordan  measurable  set  S,  then  S  —  T 


5.  Suppose  that  H  is  a  subset  of  a  compact  Jordan  measurable  set  S  such  that  the  inter- 
section of  H  with  any  compact  subset  of  S°  has  zero  content.  Show  that  V(H)  =  0. 

6.  Suppose  that  E  is  an  n  x  n  elementary  matrix  and  A  is  an  arbitrary  n  x  p  matrix. 
Show  that  EA  is  the  matrix  obtained  by  applying  to  A  the  operation  by  which  E  is 
obtained  from  the  n  x  n  identity  matrix. 

7.  (a)    Calculate  the  determinants  of  elementary  matrices  of  types  (a),  (b),  and  (c) 

of  Lemma  7.3.6. 

(b)  Show  that  the  inverse  of  an  elementary  matrix  of  type  (a),  (b),  or  (c)  is  an 
elementary  matrix  of  the  same  type. 

(c)  Verify  the  inverses  given  for  Ei  E6  in  Example  7.3.1. 


or 


is  Jordan  measurable. 
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8.    Write  as  a  product  of  elementary  matrices. 


1 

0 

1  " 

"  2 

3 

-2 

(a) 

1 

1 

0 

(b) 

0 

-1 

5 

0 

1 

1 

0 

-2 

4 

9.     Suppose  that  ad— be  ^0,u\  <  u2,  and^i  <  v2.  Find  the  area  of  the  parallelogram 
bounded  by  the  lines 

ax  +  by  =  Mi,  ax  +  by  =  u%, 
cx  +  dy  =  v\,       cx  +  dy  =  v2. 

10.     Find  the  volume  of  the  parallelepiped  defined  by 

1  <  2x  +  3y  -2z  <  2,    5  <  -x  +  5y  <  7,     1  <  -2x  +  Ay  <  6. 

In  writing  Eqn.  (7.3.53)  we  assumed  that 


11 


12. 


/     f(X)dX=  f       f(X)dX+  f  f{X)dX. 
JG(S)  JG(Si)  JG(sns[) 

Justify  this.  Hint:  Show  that  G(Si )  n  G(S  n  S[ )  has  zero  content. 
Use  Theorem  7.3.8  to  show  that  I\  =  1%  =  I3  =  I4  in  Example  7.3.7. 
13.     Let  e,-  =  ±1,  0  <  i  <  n.  Let  T  be  a  bounded  subset  of  R"  and 

T  =  {(eix1,e2x2,...,e„x„)\  (xi,  x2, . . . ,  x„)  e  T) . 

Suppose  that  /  is  defined  on  T  and  define  g  on  T  by 

g(e1x1,e2x2, . . .  ,e„xn)  =  e0f(x1,x2,...,x„). 

(a)  Prove  directly  from  Definitions  7.1.2  and  7.1.17  that  /  is  integrable  on  T  if 
and  only  if  g  is  integrable  on  T,  and  in  this  case 

fg(Y)dY  =  e0  [  f(X)dX. 
Jt  Jt 

(b)  Suppose  that  f  =  T, 

f(eiXi,e2x2,...,e„x„)  =  -f(x1,x2,...,xn), 
and  /  is  integrable  on  T.  Show  that 

/(X)  dX  =  0. 


14.     Find  the  area  of 

(a)    {(x,  y)  I  y  <  x  <  Ay,  1  <  x  +  2y  <  3}; 
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(b)    {(x,y)\2  <  xy  <  4,  2x  <  y  <  5x}. 

15.  Evaluate 

J  (3x2  +  2y  +  z)  d(x,  y,z), 

where 

T  =  {(x,y,z)\\x-y\  <  1,  \y  -  z\  <  1,  \z  +  x\  <  1}  . 

16.  Evaluate 

J^(y2  +  x2y-2x4)d(x,y), 
where  T  is  the  region  bounded  by  the  curves 

xy  =  1,    xj  =  2,    j  =  x2,    y  =  x2  +  1. 

17.  Evaluate 


J  (x4-  y4)exy  d(x,y), 


where  T  is  the  region  in  the  first  quadrant  bounded  by  the  hyperbolas 
xy  =  1,    xj  =  2,    x2  —  y2  =  2,    x2  —  y2  =  3. 

18.     Find  the  volume  of  the  ellipsoid 

2  2  2 

X        V  z, 

— +  f-  +  — =  1  (fl.fc.oO). 


19.  Evaluate 


where 


d(x,  >>,  z), 


^x2  +  J2  +  z2 
r  =  {(x,  y,  z)  |  9  <  x2  +  y2  +  z2  <  25} . 


20.  Find  the  volume  of  the  set  T  bounded  by  the  surfaces  z  =  0,  z  =  ^/x2  +  y2,  and 
x2  +  y2  =  4. 

21.  Evaluate 

y  xyz(x4  —  y4)  d(x,  y,  z), 

where 


T  =  {(x,  j,  z)  |  1  <  x2  -  y2  <  2,  3  <  x2  +  y2  <  4,  0  <  z  <  l}  . 
22.  Evaluate 

rV2  f-y/4-y2  Jx  p2  /-V4-X2 


/l           /•  Vl-x2  /■  V  l-x2-j2 

rfx  /   J}>  /  z2fi?z 
-l      J-a/T^I2  Jo 
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23.     Use  the  change  of  variables 

=  G(r,  01,02,03) 


x\ 

X3 
X4 


r  cos  t>i  cos  t>2  cos  fc>3 
r  sin  0i  cos  02  cos  03 
r  sin  02  cos  03 
r  sin  03 


to  compute  the  content  of  the  4-ball 

T  =  {(xi ,  x2,  X3,  X4)  I  x2  +  x\  +  x\  +  x\  <  a2} . 

24.     Suppose  that  A  =         is  a  nonsingular  «  x  n  matrix  and  T  is  the  region  in  W 
defined  by 

ai  <  cii\X\  +  ai2x2  H  h  flin^  <  jS,- ,     1  <  i  <  n. 

(a)  FindF(r). 

(b)  Show  that  if  c\,  c2,       cn  are  constants,  then 


4lH 


i  =  l 


where 


d\ 

C\ 

d2 

C2 

_  d„ 

_  C" 

25.     If  V„  is  the  content  of  the  «-ball  T  =  {X  |  |X|  <  l},  find  the  content  of  the  n- 
dimensional  ellipsoid  defined  by 


n  2 


Leave  the  answer  in  terms  of  V„ . 


CHAPTER  8 
Metric  Spaces 


IN  THIS  CHAPTER  we  study  metric  spaces. 

SECTION  8.1  defines  the  concept  and  basic  properties  of  a  metric  space.  Several  examples 
of  metric  spaces  are  considered. 

SECTION  8.2  defines  and  discusses  compactness  in  a  metric  space. 
SECTION  8.3  deals  with  continuous  functions  on  metric  spaces. 

8.1  INTRODUCTION  TO  METRIC  SPACES 

Definition  8.1.1  A  metric  space  is  a  nonempty  set  A  together  with  a  real-valued  func- 
tion p  defined  on  Ax  A  such  that  if  it,  v,  and  w  are  arbitrary  members  of  A,  then 

(a)  p(u,  v)  >  0,  with  equality  if  and  only  if  u  =  v; 

(b)  p(u,  v)  =  p(v,  m); 

(c)  p(u,  v)  <  p(w,  w)  +  p(w,  v). 

We  say  that  p  is  a  metric  on  A.  ■ 

If  n  >  2  and  U\,  u-i,  un  are  arbitrary  members  of  A,  then  (c)  and  induction  yield 
the  inequality 


Example  8.1.1  The  set  R  of  real  numbers  with  p(u,  v)  =  \u  —  v\  is  a  metric  space. 
Definition  8.1.1  (c)  is  the  familiar  triangle  inequality: 


Motivated  by  this  example,  in  an  arbitrary  metric  space  we  call  p(u,  v)  the  distance  from 
u  to  v,  and  we  call  Definition  8.1.1  (c)  the  triangle  inequality. 


\u  —  v\  <  \u  —  w\  +  \  w  —  u\. 
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Example  8.1.2  If  A  is  an  arbitrary  nonempty  set,  then 


p(u,  v)  = 


! 


0  if  u  =  v, 

1  ifu^v 


is  a  metric  on  A  (Exercise  8.1.5).  We  call  it  the  discrete  metric. 


Example  8.1.2  shows  that  it  is  possible  to  define  a  metric  on  any  nonempty  set  A.  In 
fact,  it  is  possible  to  define  infinitely  many  metrics  on  any  set  with  more  than  one  member 
(Exercise  8.1.3).  Therefore,  to  specify  a  metric  space  completely,  we  must  specify  the 
couple  (A,  p),  where  A  is  the  set  and  p  is  the  metric.  (In  some  cases  we  will  not  be  so 
precise;  for  example,  we  will  always  refer  to  the  real  numbers  with  the  metric  p(u,  v)  = 
\u  —  v\  simply  as  K.) 

There  is  an  important  kind  of  metric  space  that  arises  when  a  definition  of  length  is 
imposed  on  a  vector  space.  Although  we  assume  that  you  are  familiar  with  the  definition 
of  a  vector  space,  we  restate  it  here  for  convenience.  We  confine  the  definition  to  vector 
spaces  over  the  real  numbers. 

Definition  8.1.2  A  vector  space  A  is  a  nonempty  set  of  elements  called  vectors  on 
which  two  operations,  vector  addition  and  scalar  multiplication  (multiplication  by  real 
numbers)  are  defined,  such  that  the  following  assertions  are  true  for  all  U,  V,  and  W  in 
A  and  all  real  numbers  r  and  s : 

1.  U  +  Ve  A; 

2.  U  +  V=  V  +  U; 

3.  U  +  (V  +  W)  =  (U  +  V)  +  W; 

4.  There  is  a  vector  0  in  A  such  that  U  +  0  =  U; 

5.  There  is  a  vector  — U  in  A  such  that  U  +  (—IT)  =  0; 

6.  rUe  A; 

7.  r(U  +  V)  =  rU+  rV; 

8.  (r  +  s)U  =  rV+sV; 

9.  r(s\T)  =  (rs)V; 

10.  1U  =  U.  ■ 

We  say  that  A  is  closed  under  vector  addition  if  (1)  is  true,  and  that  A  is  closed  under 
scalar  multiplication  if  (6)  is  true.  It  can  be  shown  that  if  B  is  any  nonempty  subset  of  A 
that  is  closed  under  vector  addition  and  scalar  multiplication,  then  B  together  with  these 
operations  is  itself  a  vector  space.  (See  any  linear  algebra  text  for  the  proof.)  We  say  that 
B  is  a  subspace  of  A. 

Definition  8.1.3  A  normed  vector  space  is  a  vector  space  A  together  with  a  real-valued 
function  N  defined  on  A,  such  that  if  u  and  v  are  arbitrary  vectors  in  A  and  a  is  a  real 
number,  then 

(a)  N(u)  >  0  with  equality  if  and  only  if  u  =  0; 

(b)  N(au)  =  \a\N(u); 

(c)  N(u  +  v)  <  N(u)  +  N(v). 

We  say  that  N  isa  norm  on  A,  and  (A,  N)  is  a  normed  vector  space. 
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Theorem  8.1.4  If  (A,  N)  is  a  normed  vector  space,  then 

p(x,y)  =  N(x-y)  (8.1.1) 

is  a  metric  on  A . 

Proof  From  (a)  with  u  =  x  —  y,  p(x,  y)  =  N(x  —  y)  >  0,  with  equality  if  and  only 
if  x  =  y.  From  (b)  with  u  =  x  —  y  and  a  =  —  1, 

p{y,  x)  =  N(y  -  x)  =  N(-(x  -  y))  =  N(x  -  y)  =  p(x,  y). 

From  (c)  with  u  =  x  —  z  and  v  =  z.  —  y, 

p(x,  y)  =  N(x  —  y)  <  N(x  -  z.)  +  N(z  -  y)  =  p(x,  z.)  +  p{z,  y).  u 

We  will  say  that  the  metric  in  (8.1.1)  is  induced  by  the  norm  N .  Whenever  we  speak  of 
a  normed  vector  space  (A,  N),  it  is  to  be  understood  that  we  are  regarding  it  as  a  metric 
space  (A,  p),  where  p  is  the  metric  induced  by  N. 

We  will  often  write  N(u)  as  \\u\\.  In  this  case  we  will  denote  the  normed  vector  space  as 

Theorem  8.1.5  If  x  and  y  are  vectors  in  a  normed  vector  space  (A,  N),  then 

\N(x)-N(y)\<N(x-y).  (8.1.2) 

Proof  Since 

x  =  y  +  (x  -  y), 
Definition  8.1.3(c)  with  u  =  y  and  v  =  x  —  y  implies  that 

N(x)  <  N(y)  +  N(x  -  y), 

or 

Nix)  -  N{y)  <  N{x  -  y). 

Interchanging  x  and  y  yields 

N(y)  -  N(x)  <  N(y  -  x). 

Since  N(x  —  y)  =  N(y  —  x)  (Definition  8.1.3(b)  with  u  =  x  —  y  and  a  =  —  1),  the  last 
two  inequalities  imply  (8.1.2).  H 

Metrics  for  R" 

In  Section  5.1  we  defined  the  norm  of  a  vector  X  =  (x\,  x-i, . . . ,  x„)  in  W  as 
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The  metric  induced  by  this  norm  is 

In  \  1/2 

p(X,Y)=\J2(xi-yi)2j  . 

Whenever  we  write  W  without  identifying  the  norm  or  metric  specifically,  we  are  referring 
to  R"  with  this  norm  and  this  induced  metric. 

The  following  definition  provides  infinitely  many  norms  and  metrics  on  M" . 

Definition  8.1.6  If  p  >  1  and  X  =  (xi,  x2, . . . ,  xn),  let 

In  \UP 
IIXIU  =  (X>r  j      •  (8-1-3) 

The  metric  induced  on  M"  by  this  norm  is 

Up 


Pp(X,Y)=^\Xi-y,\^j 


To  justify  this  definition,  we  must  verify  that  (8.1.3)  actually  defines  a  norm.  Since  it  is 
clear  that  \\\\\p  >  0  with  equality  if  and  only  if  X  =  0,  and  ||<zX||p  =  \a\  \\X\\P  if  a  is  any 
real  number  and  XeR",  this  reduces  to  showing  that 

l|X  +  Y||,  <  ||X||P+  ||Y||,  (8.1.4) 

for  every  X  and  Y  in  W .  Since 

\xi+yt  \  <  \xi\  +  \yi\, 

summing  both  sides  of  this  equation  from  i  =  1  to  n  yields  (8.1.4)  with  p  =  1.  To  handle 
the  case  where  p  >  1,  we  need  the  following  lemmas.  The  inequality  established  in  the 
first  lemma  is  known  as  Holder's  inequality. 

Lemma  8.1.7  Suppose  that  /xi ,  [i-i, . . . ,  fin  andv\,  v%,...,  vn  are  nonnegative  numbers. 
Let  p  >  1  andq  =  p/(p  —  1);  thus, 

1  1 

-  +  -  =  1.  (8.1.5) 
p  q 

Then 

Up  i  n      \  lli 


n  In  \lIPIn 

X>m<(X>fJ     (£v?J     •  (8.1.6) 


1=1  \!=1         /  \!=1 

Proof   Let  a  and  /3  be  any  two  positive  numbers,  and  consider  the  function 

ap  R" 
f(P)  =  —  +  —  -ctP, 

p  q 
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where  we  regard  a  as  a  constant.  Since  f'(fi)  =  Pq  1  —  a  and  f"(P)  =  (q  —  i)Pq  2  >  0 
for  P  >  0,  /  assumes  its  minimum  value  on  [0,  oo)  at  P  =  a1^-1'  =  ap~1.  But 


f{<*p-1) 


a 

 h 


-aP  =aP  ( I  +  1-1  ]  =  o. 


Therefore, 


Now  let 


From  (8.1.7), 


a"  Bq 
aft  < —  +  —    if  a,p>0. 
P  q 


(8.1.7) 


-i/p 


-1/9 


(Xi  = 


I E  i1" 


and     Pi  =  Vi  (  ^2  vj 


OLiPi 


jl 


pin 


gin 


Etf    +v-t  E^: 


From  (8.1.5),  summing  this  from  i  =  1  to  «  yields  5Z?=i  <  1,  which  implies  (8.1.6). 
TJ 

Lemma  8.1.8  (Minkowski's  Inequality)  Suppose  that u\,ui,...,  un  andv\, 
U2, ... ,  v„  are  nonnegative  numbers  and  p  >  1.  T/zen 


i//> 


E"f    +  IX 


V(=l 


V;=l 


V;=l 


Proof   Again,  let  ^  =  p/(p  —  1).  We  write 


/'  =  ! 


i  =  l 


i  =  l 


From  Holder's  inequality  with /z,  =  m,  and  v,  =  (w,  +  Vi  )p  , 


1/9 


£uj(u<  +  <    E"f  E<"<'  +  V'F 


(8.1.8) 


(8.1.9) 


(8.1.10) 


(=i 


\(=i 


\(=i 


since  q(p  —  1)  =  p.  Similarly, 


i/p 


1/9 


7=1 


\/=l 


\/=l 


This,  (8.1.9),  and  (8.1.10)  imply  that 

\  1/.P       /  n        \  !//>' 

X>f    +  E»f 


(=i 


1/9 


(  =  1 
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Since  1  —  1/q  =  1/ p,  this  implies  (8.1.8),  which  is  known  as  Minkowski's  inequality.  H 
We  leave  it  to  you  to  verify  that  Minkowski's  inequality  implies  (8. 1 .4)  if  p  >  1 . 
We  now  define  the  oo-norm  on  W  by 

||X||oo  =  max{|x,  |  |  1  <  i  <  n)  .  (8.1.11) 

We  leave  it  to  you  to  verify  (Exercise  8.1.15)  that  ||  ■  ||oo  is  a  norm  on  R".  The  associated 
metric  is 

Poo(X,  Y)  =  max  [\xt  -  yi  \  |  1  <  i  <  n) . 
The  following  theorem  justifies  the  notation  in  (8.1.11). 

Theorem  8.1.9  IfX  e  R"  and  p2  >  pi  >  1,  then 

\\Mp2<\\Mpi\  (8-1.12) 


moreover, 


lim  ||X||p  =  max{|x/|  |  1  <  i  <  n) .  (8.1.13) 


Proof   Let  U\,  112,  ...,unbe  nonnegative  and  M  =  max  {«,  |  1  <  i  <  71} .  Define 

/  n  \1/P 


Since  u\jo{p)  <  1  and  p2  >  pi, 


\i  =  l 


therefore, 


(  Ui  V  >  ( 
\<y(P2)J    ~  \<y(p 


P\  \ 11  px      (  »    /         \Pz\  1/pi 


=  1, 


soa(pi)  >  <j(p2).  Since  M  <  a(p)  <  Mnllp,  \\mp^00a(p)  =  M.  Letting  w,-  =  \xt\ 
yields  (8.1.12)  and  (8.1.13).  H 

Since  Minkowski's  inequality  is  false  if  p  <  1  (Exercise  8.1.19),  (8.1.3)  is  not  a  norm  in 
this  case.  However,  if  0  <  p  <  1,  then 


i  =  l 

is  a  norm  on  R"  (Exercise  8.1.20). 

Vector  Spaces  of  Sequences  of  Real  Numbers 

In  this  section  and  in  the  exercises  we  will  consider  subsets  of  the  vector  space  R°°  con- 
sisting of  sequences  X  =  {xi}??_l,  with  vector  addition  and  scalar  multiplication  defined 

by 

X  +  Y  =  {xi+yi}<?±1    and     rX  =  {rxi}fLl. 
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Example  8.1.3  Suppose  that  1  <  p  <  oo  and  let 

X  e  M°°  |  I 

Let 


wi,  =  f£>i'J 


l/p 


\i  =  l 

Show  that  (lp,  ||  ■  Hp)  is  a  normed  vector  space. 

Solution  Suppose  that  X,  Y  e  lp.  From  Minkowski's  inequality, 

(n  \lIP         /  n  \l/P/n  \  lIP 

for  each  n.  Since  the  right  side  remains  bounded  as  n  — >  oo,  so  does  the  left,  and 

/  oo  \l/P       /  oo  \  VP       /  oo  \  VP 

X>+j«r    -En*    +  (l>i')  '  (8-L14) 


\i  =  \  /  \i  =  l  /  \i'=l 


so  X  +  Y  €  lp.  Therefore,  lp  is  closed  under  vector  addition.  Since  lp  is  obviously  closed 
under  scalar  multiplication,  lp  is  a  vector  space,  and  (8.1.14)  implies  that  ||  ■  \\p  is  a  norm 
on^p.  ■ 

The  metric  induced  by  ||  •  \\p  is 

(oo  \  1Ip 

J2\xt-yi\p)  ■ 

Henceforth,  we  will  denote  (lp,  \\  ■  \\p)  simply  by  lp. 
Example  8.1.4  Let 

lx  =  {X  €  M°°  |  {xt}ft i  is  bounded}  . 

Let 

||X||oo  =  SUp{|x,|  |/  >  l}. 

We  leave  it  to  you  (Exercise  8.1.26)  to  show  that  (loo,  II  •  II oo)  is  a  normed  vector  space.  ■ 
The  metric  induced  by  ||  ■  Woo  is 

Poo(X,  Y)  =  sup  {\xt  -  yi\ |  i  >  1}  . 
Henceforth,  we  will  denote  (too,  II  ■  II oo)  simply  by  1^. 
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Familiar  Definitions  and  Theorems 

At  this  point  you  may  want  to  review  Definition  1.3.1  and  Exercises  1.3.6  and  1.3.7,  which 
apply  equally  well  to  subsets  of  a  metric  space  (A,  p). 

We  will  now  state  some  definitions  and  theorems  for  a  general  metric  space  (^4,  p)  that 
are  analogous  to  definitions  and  theorems  presented  in  Section  1.3  for  the  real  numbers.  To 
avoid  repetition,  it  is  to  be  understood  in  all  these  definitions  that  we  are  discussing  a  given 
metric  space  (A,  p). 

Definition  8.1.10  If  Uo  e  A  and  e  >  0,  the  set 

Ne(uo)  =  {w  e  A  |  p(uo,  u)  <  e} 

is  called  an  € -neighborhood  of  uq  .  (Sometimes  we  call  S€  the  open  ball  of  radius  e  centered 
at  Mo-)  If  a  subset  S  of  A  contains  an  e-neighborhood  of  uo,  then  S  is  a  neighborhood  of 
uq,  and  Mo  is  an  interior  point  of  S.  The  set  of  interior  points  of  5  is  the  interior  of  S, 
denoted  by  S°.  If  every  point  of  S  is  an  interior  point  (that  is,  5°  =  5),  then  5  is  open.  A 
set  S  is  closed  if  Sc  is  open. 

Example  8.1.5  Show  that  if  r  >  0,  then  the  open  ball 

Sr(uo)  =  {u  e  A  \  p(uo,  u)  <  r} 

is  an  open  set. 


Solution  We  must  show  that  if  u\  e  Sr(uo),  then  there  is  an  e  >  0  such  that 

Se(ui)  C  Sr(u0).  (8.1.15) 

If  Mi  e  SV(wo),  then  p(ui,  wo)  <  r.  Since 

p(u,  u0)  <  p(u,  Mi)  +  p{u\,  Uo) 

for  any  u  in  A,  p(u,uo)  <  >"  if  p{u,u\)  <  r  —  p(u\,Uo)-  Therefore,  (8.1.15)  holds  if 
e  <  r  —  p(ui,  u0).  ■ 

The  entire  space  A  is  open  and  therefore  0  (=  Ac)  is  closed.  However,  0  is  also  open, 
for  to  deny  this  is  to  say  that  it  contains  a  point  that  is  not  an  interior  point,  which  is  absurd 
because  0  contains  no  points.  Since  0  is  open,  A  (=  0C)  is  closed.  If  A  =  R,  these  are  the 
only  sets  that  are  both  open  and  closed,  but  this  is  not  so  in  all  metric  spaces.  For  example, 
if  p  is  the  discrete  metric,  then  every  subset  of  A  is  both  open  and  closed.  (Verify!) 

A  deleted  neighborhood  of  a  point  uq  is  a  set  that  contains  every  point  of  some  neigh- 
borhood of  uo  except  uo  itself.  (If  p  is  the  discrete  metric  then  the  empty  set  is  a  deleted 
neighborhood  of  every  member  of  Al) 

The  proof  of  the  following  theorem  is  identical  to  the  proof  Theorem  1.3.3. 
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Theorem  8.1.11 

(a)  The  union  of  open  sets  is  open. 

(b)  The  intersection  of  closed  sets  is  closed. 

Definition  8.1.12  Let  5  be  a  subset  of  A.  Then 

(a)  Mo  is  a  limit  point  of  S  if  every  deleted  neighborhood  of  uq  contains  a  point  of  S . 

(b)  uq  is  a  boundary  point  of  S  if  every  neighborhood  of  uq  contains  at  least  one  point 
in  5  and  one  not  in  5.  The  set  of  boundary  points  of  5  is  the  boundary  of  S,  denoted 
by  dS.  The  closure  of  S,  denoted  by  S,  is  denned  by  5  =  S  U  35. 

(c)  Mo  is  an  isolated  point  of  S  if  Mo  €  S  and  there  is  a  neighborhood  of  Mo  that  contains 
no  other  point  of  5 . 

(d)  Mo  is  exterior  to  S  if  Mo  is  in  the  interior  of  Sc .  The  collection  of  such  points  is  the 
exterior  of  5.  ■ 

Although  this  definition  is  identical  to  Definition  1.3.4,  you  should  not  assume  that  con- 
clusions valid  for  the  real  numbers  are  necessarily  valid  in  all  metric  spaces.  For  example, 
if  A  =  K  and  p(u,  v)  =  \u  —  v\,  then 

Sr(u0)  =  [u  |  p(u,  w0)  <  r)  . 

This  is  not  true  in  every  metric  space  (Exercise  8.1.6). 

For  the  proof  of  the  following  theorem,  see  the  proofs  of  Theorem  1.3.5  and  Corol- 
lary 1.3.6. 

Theorem  8.1.13  A  set  is  closed  if  and  only  if  it  contains  all  its  limit  points. 


Completeness 

Since  metric  spaces  are  not  ordered,  concepts  and  results  concerning  the  real  numbers  that 
depend  on  order  for  their  definitions  must  be  redefined  and  reexamined  in  the  context  of 
metric  spaces.  The  first  example  of  this  kind  is  completeness.  To  discuss  this  concept,  we 
begin  by  defining  an  infinite  sequence  (more  briefly,  a  sequence)  in  a  metric  space  {A,  p)  as 
a  function  defined  on  the  integers  n  >  k  with  values  in  A.  As  we  did  for  real  sequences,  we 
denote  a  sequence  in  A  by,  for  example,  {un}  =  {un}^=!c.  A  subsequence  of  a  sequence 
in  A  is  defined  in  exactly  the  same  way  as  a  subsequence  of  a  sequence  of  real  numbers 
(Definition  4.2.1). 

Definition  8.1.14  A  sequence  {u„\  in  a  metric  space  (A,  p)  converges  to  u  e  A  if 

lim  p(un,u)  =  0.  (8.1.16) 

n — ^oo 

In  this  case  we  say  that  lim„^oo  un  =  u.  ■ 

We  leave  the  proof  of  the  following  theorem  to  you.  (See  the  proofs  of  Theorems  4.1.2 
and  4.2.2.) 
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Theorem  8.1.15 

(a)  The  limit  of  a  convergent  sequence  is  unique. 

(b)  //lim„ 

->oo  un  =  M>  then  every  subsequence  of{un}  converges  to  u. 

Definition  8.1.16  A  sequence  {un}  in  a  metric  space  (^4,  p)  is  a  Cauchy  sequence  if 
for  every  e  >  0  there  is  an  integer  N  such  that 

p(un,um)<e    and     m,n>N.  (8.1.17) 


We  note  that  if  p  is  the  metric  induced  by  a  norm  ||  ■  ||  on  A,  then  (8.1.16)  and  (8.1.17) 
can  be  replaced  by 

lim  \\u„  —  u\\  =  0 

n— >-oo 

and 

Hun  —  «m  II  <  e    and    m,  n  >  N, 

respectively. 

Theorem  8.1.17  If  a  sequence  {un}  in  a  metric  space  (A,  p)  is  convergent,  then  it  is 
a  Cauchy  sequence. 

Proof  Suppose  that  limn^oo  un  =  u.  If  e  >  0,  there  is  an  integer  N  such  that 
p(u„,  u)  <  e/2  if  n  >  N .  Therefore,  if  m,  n  >  N,  then 

p(u        )  <  p(u„,u)  +  p(u,um)  <  e.  JJ 


Definition  8.1.18  A  metric  space  (A,p)  is  complete  if  every  Cauchy  sequence  in  A 
has  a  limit. 

Example  8.1.6  Theorem  4.1.13  implies  that  the  set  K  of  real  numbers  with  p(u,  v) 
=  |  u  —  v  |  is  a  complete  metric  space.  ■ 

This  example  raises  a  question  that  we  should  resolve  before  going  further.  In  Section  1.1 
we  defined  completeness  to  mean  that  the  real  numbers  have  the  following  property: 

Axiom  (I).  Every  nonempty  set  of  real  numbers  that  is  bounded  above  has  a  supremum. 

Here  we  are  saying  that  the  real  numbers  are  complete  because  every  Cauchy  sequence 
of  real  numbers  has  a  limit.  We  will  now  show  that  these  two  usages  of  "complete"  are 
consistent. 

The  proof  of  Theorem  4.1.13  requires  the  existence  of  the  (finite)  limits  inferior  and 
superior  of  a  bounded  sequence  of  real  numbers,  a  consequence  of  Axiom  (I).  However, 
the  assertion  in  Axiom  (I)  can  be  deduced  as  a  theorem  if  Axiom  (I)  is  replaced  by  the 
assumption  that  every  Cauchy  sequence  of  real  numbers  has  a  limit.  To  see  this,  let  T  be  a 
nonempty  set  of  real  numbers  that  is  bounded  above.  We  first  show  that  there  are  sequences 
{m        and  {v,        with  the  following  properties  for  all  i  >  1: 
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(a)  Ui  <  t  for  some  t  €  T  and  u,  >  t  for  all  t  e  T; 

(b)  (u;-m;)<2'"1(^i-"i)- 

(c)  M;  <  Ui  +  l  <      +  1  <  Vj 

Since  7"  is  nonempty  and  bounded  above,  u\  and  v\  can  be  chosen  to  satisfy  (a)  with 
i  =  1.  Clearly,  (b)  holds  with  i  =  1.  Let  w\  =  (u\  +  v\)/2,  and  let 


("2,  v2) 


{w\,v\)  if  w\  <  t  for  some  t  e  7", 
(wi,u)i)     if  u)i  >  f  for  all  t  e  7\ 


In  either  case,  (a)  and  (b)  hold  with  i  =  2  and  (c)  holds  with  i  =  1.  Now  suppose  that 
n  >  1  and  {ui, . . . ,  Mn}  and  {vi, . . . ,  vn}  have  been  chosen  so  that  (a)  and  (b)  hold  for 
I  <  i  <  n  and  (c)  holds  for  1  <  i  <  n  —  1.  Let  wn  =  (un  +  v„  )/2  and  let 


{Un+l,  Vn+l)  = 


(wn,  v„)  ifwn<t  for  some  t  e  T , 
(un ,  wn)     if  wn  >  /  for  all  ?  e  7". 


Then  (a)  and  (b)  hold  for  1  <  i  <  n  +  1  and  (c)  holds  for  1  <  i  <  n.  This  completes 
the  induction. 

Now  (b)  and  (c)  imply  that 

0  <  w;+i  —  m  <  2'~1(vi  —  Mi)    and     0  <  Vi  —  Uj+i  <  2'~1(vi  —  ui),     i  >  1. 

By  an  argument  similar  to  the  one  used  in  Example  4.1.14,  this  implies  that  {m,  },0^  and 
{u/}^!  m'e  Cauchy  sequences.  Therefore  the  sequences  both  converge  (because  of  our 
assumption),  and  (b)  implies  that  they  have  the  same  limit.  Let 

lim  Ui  =  lim  Vi  =  ft. 

i —>oo  i—too 

If  t  e  T,  then  i>,  >  t  for  all  i,  so  ji  =  lim^oo  Vj  >  t;  therefore,  ft  is  an  upper  bound  of 
T .  Now  suppose  that  e  >  0.  Then  there  is  an  integer  such  that  >  /3  —  e.  From  the 
definition  of  un,  there  is  a  tjy  in  T  such  that      >  un  >  P  —  e-  Therefore,  ft  =  sup  T.  H 

Example  8.1.7  (The  Metric  Space  C [a.  b])  Let  C [a,  b]  denote  the  set  of  all 
real-valued  functions  /  continuous  on  the  finite  closed  interval  [a,b].  From  Theorem  2.2.9, 
the  quantity 

H/ll  =max{|/(x)||a<*<&} 

is  well  defined.  We  leave  it  to  you  to  verify  that  it  is  a  norm  on  C  [a ,  b] .  The  metric  induced 
by  this  norm  is 

P(Lg)  =  =  max {!/(•*) -#C*)I  \  a  <  x  <  b) . 

Whenever  we  refer  to  C[a,b],  we  mean  this  metric  space  or,  equivalently,  this  normed 
linear  space.  ■ 

From  Theorem  4.4.6,  a  Cauchy  sequence  {/„}  in  C[a,  b]  converges  uniformly  to  a  func- 
tion /  on  [a,  b],  and  Corollary  4.4.8  implies  that  /  is  in  C  [a,  b\,  hence,  C[a,b]  is  complete. 
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The  Principle  of  Nested  Sets 

We  say  that  a  sequence  {7^}  of  sets  is  nested  if  Tn+i  C  Tn  for  all  n. 

Theorem  8.1.19  (The  Principle  of  Nested  Sets)  A  metric  space  (A,  p)  is 
complete  if  and  only  if  every  nested  sequence  {Tn}  of  nonempty  closed  subsets  of  A  such 
that  linin-^oo  d(T„ )  =  0  has  a  nonempty  intersection. 

Proof  Suppose  that  (A,  p)  is  complete  and  {Tn}  is  a  nested  sequence  of  nonempty 
closed  subsets  of  A  such  that  lim„^oo  d(T„)  =  0.  For  each  n,  choose  tn  e  Tn.  If  m  >  n, 
then  tm,  tn  e  Tn,  so  p(tn,  tm)  <  d(Tn).  Since  limn-xx,  d{Tn)  =  0,  {t„}  is  a  Cauchy  se- 
quence. Therefore,  lim„^oo  tn  =  7  exists.  Since  7  is  a  limit  point  of  Tn  and  Tn  is  closed 
for  all  n,teT„  for  all  n.  Therefore,  7  e  n£Lj  T„;  in  fact,  n^x  r„  =  {7}.  (Why?) 

Now  suppose  that  (A,  p)  is  not  complete,  and  let  {?„}  be  a  Cauchy  sequence  in  A  that 
does  not  have  a  limit.  Choose  n\  so  that  p(fn,tni)  <  1/2  if  n  >  «i,  and  let  7i  = 
{t  |  p(t,tni)  <  l}.  Now  suppose  that  j  >  1  and  we  have  specified  m,  n%,  nj-i 
and  7\,  Ti,        7}-i.  Choose  «y  >  so  that  p(t„,tnj)  <  2~-*  if  ;i  >  rij,  and  let 

=  \t  |  p(t,tnj)  <  2_7  +  1}.  Then  7y  is  closed  and  nonempty,  7y+i  C  7}  for  all  y, 
and  limy-^oo  d(Tj)  =  0.  Moreover,  f„  e  Tj  if «  >  «y.  Therefore,  if  7  €  PI^LjTy, 
then  p(t„,T)  <  2~-/,  «  >  «/,  so  lim„^oo  f„  =  7,  contrary  to  our  assumption.  Hence, 

n£L17}  =  0. 
Equivalent  Metrics 

When  considering  more  than  one  metric  on  a  given  set  A  we  must  be  careful,  for  example, 
in  saying  that  a  set  is  open,  or  that  a  sequence  converges,  etc.,  since  the  truth  or  falsity  of 
the  statement  will  in  general  depend  on  the  metric  as  well  as  the  set  on  which  it  is  imposed. 
In  this  situation  we  will  alway  refer  to  the  metric  space  by  its  "full  name;"  that  is,  (A,  p) 
rather  than  just  A. 

Definition  8.1.20  If  p  and  a  are  both  metrics  on  a  set  A,  then  p  and  a  are  equivalent 
if  there  are  positive  constants  a  and  /?  such  that 

a<^—^-<P    forall     x,y&A    such  that    x  ^  y.  (8.1.18) 
a(x,y) 

Theorem  8.1.21  If  p  and  a  are  equivalent  metrics  on  a  set  A,  then  (A,  p)  and  (A,  a) 

have  the  same  open  sets. 

Proof  Suppose  that  (8.1.18)  holds.  Let  S  be  an  open  set  in  (A,  p)  and  let  xq  e  S.  Then 
there  is  an  e  >  0  such  that  x  e  5  if  p(x,  xq)  <  e,  so  the  second  inequality  in  (8.1.18) 
implies  that  xq  e  S  if  a(x,  xq)  <  e/ /3.  Therefore,  S  is  open  in  (A,  a). 

Conversely,  suppose  that  S  is  open  in  (A,  a)  and  let  xq  e  S .  Then  there  is  an  e  >  0 
such  that  x  e  5  if  a(x,  xo)  <  e,  so  the  first  inequality  in  (8.1.18)  implies  that  Xo  e  S  if 
p(x,  xq)  <  ea.  Therefore,  S  is  open  in  (^4,  p).  H 
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Theorem  8.1.22  Any  two  norms  N\  and  N2  onW  induce  equivalent  metrics  on  W . 

Proof   It  suffices  to  show  that  there  are  positive  constants  a  and  f)  such 

Ni  (X) 

<x<^j^-<P    if    X^O.  (8.1.19) 

We  will  show  that  if  N  is  any  norm  on  R",  there  are  positive  constants  and  such 
that 

«jv||X||2  <  N(X)  <  bN\\X\\2    if    X^O  (8.1.20) 
and  leave  it  to  you  to  verify  that  this  implies  (8.1.19)  with  a  =  dN1/bN2  and  P  =  bN1/aN2- 
We  write  X  —  Y  =  (jci,  Xi, . . . ,  xn)  as 


(=1 

where  E;  is  the  vector  with  i  th  component  equal  to  1  and  all  other  components  equal  to  0. 
From  Definition  8.1.3(b),  (c),  and  induction, 

n 

N(x-Y)<J2\xi-yi\Nm; 
7=1 

therefore,  by  Schwarz's  inequality, 

N(X- Y)  <  K\\X- Y||2,  (8.1.21) 

where 


(n  s 


1/2 


From  (8.1.21)  and  Theorem  8.1.5, 

\N(X)-N(Y)\  <  AT||X  —  Y||2, 

so  N  is  continuous  on  =  M".  By  Theorem  5.2.12,  there  are  vectors  Ui  and  U2  such 
that  HU1H2  =  HU2II2  =  1, 

N(Vi)  =  min  {N(V)  \  ||U||2  =  1}  ,    and    N(V2)  =  max  {N(V)  \  ||U||2  =  1} . 

If  on  =  N(Ui)  and  /5jv  =  N(V2),  then  a  n  andbw  are  positive  (Definition  8.1.3(a)),  and 

aN  <  N  ( — —  )  <  bN    if  X^O. 


This  and  Definition  8.1.3(b)  imply  (8.1.20).  TJ 
We  leave  the  proof  of  the  following  theorem  to  you. 
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Theorem  8.1.23  Suppose  that  p  and  a  are  equivalent  metrics  on  A.  Then 

(a)  A  sequence  {u„}  converges  to  u  in  (A,  p)  if  and  only  if  it  converges  to  u  in  {A,  a). 

(b)  A  sequence  {un  }  is  a  Cauchy  sequence  in  (A,  p)  if  and  only  if  it  is  a  Cauchy  sequence 
in  (A,  a). 

(c)  (A,  p)  is  complete  if  and  only  if  {A,  a)  is  complete. 


8.1  Exercises 


1.  Show  that  (a),  (b),  and  (c)  of  Definition  8.1.1  are  equivalent  to 

(i)    p(w,  v)  =  0  if  and  only  if  u  =  v, 
(ii)   p(u,v)  <  p(w,u)  +  p(w,v). 

2.  Prove:  If  x,  y,  u,  and  v  are  arbitrary  members  of  a  metric  space  (A,  p),  then 

\p(x,  y)  -  p(u,  v)\  <  p(x,  u)  +  p(v,  y). 

3.  (a)    Suppose  that  (A,  p)  is  a  metric  space,  and  define 

p(u,  v) 


pi(u,  v) 


1  +  p(u,  v) 


Show  that  (A,  pi)  is  a  metric  space, 
(b)    Show  that  infinitely  many  metrics  can  be  defined  on  any  set  A  with  more  than 
one  member. 

Let  (A,  p)  be  a  metric  space,  and  let 

p(w,  v) 


o(u,  v) 


1  +  p(w,  v) 

Show  that  a  subset  of  A  is  open  in  {A,  p)  if  and  only  if  it  is  open  in  (A,  a). 
Show  that  if  A  is  an  arbitrary  nonempty  set,  then 


p(u,  v) 


0  if  v  =  u, 

1  if  v  ^  u, 


is  a  metric  on  A. 
6.     Suppose  that  (A,  p)  is  a  metric  space,  uq  e  A,  and  r  >  0. 

(a)  Show  that  Sr(wo)  C  [u  |  p(w,  wo)  <  r  ]  if  A  contains  more  than  one  point. 

(b)  Verify  that  if  p  is  the  discrete  metric,  then  Si(wo)  ^  {u  \  P(M>  Mo)  <  l}- 
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7.  Prove: 

(a)  The  intersection  of  finitely  many  open  sets  is  open. 

(b)  The  union  of  finitely  many  closed  sets  is  closed. 

8.  Prove: 

(a)  If  U  is  a  neighborhood  of  w0  and  U  C  V,  then  V  is  a  neighborhood  of  m0. 

(b)  If  U\,  U2,  ■  ■  ■ ,  U„  are  neighborhoods  of  Uo,  so  is  n"=1£/,\ 

9.  Prove:  A  limit  point  of  a  set  S  is  either  an  interior  point  or  a  boundary  point  of  S. 

10.  Prove:  An  isolated  point  of  S  is  a  boundary  point  of  Sc . 

11.  Prove: 

(a)  A  boundary  point  of  a  set  S  is  either  a  limit  point  or  an  isolated  point  of  S. 

(b)  A  set  S  is  closed  if  and  only  if  S  =  S. 

12.  Let  S  be  an  arbitrary  set.  Prove:  (a)  3S  is  closed,  (b)  5°  is  open,  (c)  The  exterior 
of  S  is  open,  (d)  The  limit  points  of  5  form  a  closed  set.  (e)  (S)  =  S. 

13.  Prove: 

(a)  (Si  n  52)°  =  S°  n  S2°  (b)  5°  U  S°  c  (Si  U  S2)° 

14.  Prove: 

(a)  3(5i  U  S2)  C  dSi  U  9S2  (b)  3(5i  n  S2)  C  3Si  U  3S2 

(c)  3S  c  3S  (d)  3S  =  3SC 
(e)  d(S-T)  cdSUdT 

15.  Show  that 

||X||  =max{|x1|,|x2|,...,|x„|} 

is  a  norm  on  W . 

16.  Suppose  that  (A{ ,  p;  ),  1  <  i  <     are  metric  spaces.  Let 

A  =  A\  x  A2  x  •  •  •  x  Ak  =  {X  =  (xi,  X2,  ■  ■  ■ ,  Xk)  \  x\  e  Aj,  1  <  i  <  k)  . 
If  X  and  Y  are  in  A,  let 

k 

p(X,Y)  =  j,). 

i  =  l 

(a)    Show  that  p  is  a  metric  on  A. 
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(b)  Let  {XrJ^j  =  {(xir,  x2r,  ■  ■  ■ ,  Xkr)}^Li  ^e  a  sequence  in  A.  Show  that 

lim  Xr  =  X  =  (jci,  x2,  •  •  •  ,  x/t) 

if  and  only  if 

lim  Xir  =Xi,     1  <  i  <k. 

r-±oo 

(c)  Show  that  {Xr}^=1  is  a  Cauchy  sequence  in  L4,  p)  if  and  only  if  {xir  }rxL1  is  a 
Cauchy  sequence  in  (Ai,  p,),  1  <  /  <  fc. 

(d)  Show  that  (4,  p)  is  complete  if  and  only  if  (Aj ,  p;)  is  complete,  1  <  i  <  k. 

17.  For  each  positive  integer  i,  let  (At,  p;)  be  a  metric  space.  Let  A  be  the  set  of  all 
objects  of  the  form  X  =  (xi ,  x2, . . . ,  xn , . . . ),  where  x(-  e  Aj ,  i  >  1.  (For  example, 
if  4,  =  R,  z  >  1,  then  A  =  R°°.)  Let  {a,}?^  be  any  sequence  of  positive  numbers 
such  that  5Z^=i  ai  <  00  ■ 

(a)  Show  that 

p(X,  Y)  =  >   a(-  — —  

\+pi(xi,yi) 

is  a  metric  on  A. 

(b)  Let{Xr}~=1  =  {(xlr  ,  X2r,  •  •  •  ,  Xnr,  .  . 

■  )}T=i  be  a  sequence  in  A.  Show  that 
lim  Xr  =  X  =  (xi ,  x2,  ■  .  . ,  xn, . . . ) 

if  and  only  if 

lim  Xir  =  X{,     i  >  1. 

(c)  Show  that  {Xfj^j  is  a  Cauchy  sequence  in      p)  if  and  only  if  {xir}'^=1  is  a 
Cauchy  sequence  in  (Ai ,  p;  )  for  all  z  >  1. 

(d)  Show  that  (A,  p)  is  complete  if  and  only  if  (Ai ,  p;  )  is  complete  for  all  z  >  1. 

18.  Let  C[0,  oo)  be  the  set  of  all  real-valued  functions  continuous  on  [0,  oo).  For  each 
nonnegative  integer  n,  let 

||/|U=max{|/(x)||0<x<n} 

and 

aN  II  f  ~  S\\n 

g) 


i  +  ll/-slU 

Define 

p(/>  s)  =  X!  ^=r/°»(/>  «■)• 


2" 
«=i 


(a)    Show  that  p  is  a  metric  on  C  [0,  oo). 
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(b)  Let  {fkjTLi  be  a  sequence  of  functions  in  C[0.  oo).  Show  that 

lim  fk  =  f 

k — ^oo 

in  the  sense  of  Definition  8.1.14  if  and  only  if 

lim  fk(x)  =  f(x) 

k — >CG 

uniformly  on  every  finite  subinterval  of  [0,  oo). 

(c)  Show  that  (C  [0,  oo),  p)  is  complete. 

19.  Show  that  Minkowski's  inequality  is  false  if  0  <  p  <  1. 

20.  Suppose  that  0  <  p  <  1.  Show  that  if  u  and  v  are  nonnegative,  then 

(u  +  v)p  <  up  +  vp . 
Use  this  to  show  that  if  X,  Y  e  W , 

n  n 
1=1  i=l 

then 

p(X  +  Y)<p(X)  +  p(Y). 

Is  p  a  norm  on  K"  ? 

21.  Suppose  that  X  =  {xi}f^=1  is  in  lp,  where  p  >  1.  Show  that 

(a)  Xel,  for  all  r  >  p; 

(b)  Ifr  >  /Mhen  ||X||,  <  ||X||P; 

(c)  limr^oo  ||X||r  =  ||X||oo. 

22.  Let  (A,  p)  be  a  metric  space. 

(a)  Suppose  that  {un}  and  {vn}  are  sequences  in  A,  lim,,-^  un  =  u,  and  lim^-xx,  vn 
v.  Show  that  lim„^oo  p(un,  v„)  =  p(u,  v). 

(b)  Conclude  from  (b)  that  if  limn^oo  un  =  u  and  v  is  arbitrary  in  A,  then 
linWoo  p(un,  v)  =  p(u,  v). 

23.  Prove:  If  {ur}'^=1  is  a  Cauchy  sequence  in  a  normed  vector  space  (A,  \\  ■  ||),  then 
{IM}£U  is  bounded. 

24.  Let 


A  =  I  X  e  K°°  |  the  partial  sums      Xi,  n  >  1 ,  are  bounded^  . 
(a)   Show  that 


;  =  1 


|X||  =  sup 


!  =  1 


is  a  norm  on  .4. 

(b)   Let  p(X,  Y)  =  ||X  -  Y|| .  Show  that  (A,  p)  is  complete. 
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Show  that 

ii/ii  =  /  \m\dx 

J  a 

is  a  norm  on  C[a,  b], 
Show  that  the  sequence  {  /„ }  defined  by 

is  a  Cauchy  sequence  in  (C[a,  b],  ||  •  ||). 
(c)    Show  that  (C[a,  b],  ||  ■  ||)  is  not  complete. 

26.  (a)   Verify  that  I  oo  is  a  normed  vector  space, 
(b)    Show  that      is  complete. 

27.  Let  A  be  the  subset  of  R°°  consisting  of  convergent  sequences  X  =  {x,  }f^=v  Define 
||X||  =  sup(>1        Show  that  (^4,  ||  ■  ||)  is  a  complete  normed  vector  space. 

28.  Let  A  be  the  subset  of  R°°  consisting  of  sequences  X  =  {x,  }?^  such  that  lim,  ^oo  x,  = 
0.  Define  ||X||  =  max{|x,|  |  i  >  l}.  Showthat(^4,  ||-||)  is  a  complete  normed  vector 
space. 

29.  (a)    Show  that  R"p  is  complete  if  p  >  1. 
(b)    Show  that  lp  is  complete  if  p  >  1. 

30.  Show  that  if  X  =  {x,}^  €  lp  and  Y  =  {j/}^  e  lq,  where  l/p  +  l/q  =  1, 
then  Z  =  {xtyi}  e  l\. 

8.2  COMPACT  SETS  IN  A  METRIC  SPACE 

Throughout  this  section  it  is  to  be  understood  that  (A,  p)  is  a  metric  space  and  that  the  sets 
under  consideration  are  subsets  of  A. 

We  say  that  a  collection  31  of  open  subsets  of  A  is  an  open  covering  of  T  if  T  C 
U  {H  I  H  e  X}.  We  say  that  T  has  the  Heine-Borel  property  if  every  open  covering  M 
of  T  contains  a  finite  collection  3t  such  that 

T  C  U  j#  \H  € 

From  Theorem  1.3.7,  every  nonempty  closed  and  bounded  subset  of  the  real  numbers 
has  the  Heine-Borel  property.  Moreover,  from  Exercise  1.3.21,  any  nonempty  set  of  reals 
that  has  the  Heine-Borel  property  is  closed  and  bounded.  Given  these  results,  we  defined 
a  compact  set  of  reals  to  be  a  closed  and  bounded  set,  and  we  now  draw  the  following 
conclusion: 

A  nonempty  set  of  real  numbers  has  the  Heine-Borel  property  if  and  only  if  it  is  compact. 


25.  (a) 
(b) 
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The  definition  of  boundedness  of  a  set  of  real  numbers  is  based  on  the  ordering  of  the 
real  numbers:  if  a  and  b  are  distinct  real  numbers  then  either  a  <  b  or  b  <  a.  Since  there 
is  no  such  ordering  in  a  general  metric  space,  we  introduce  the  following  definition. 

Definition  8.2.1  The  diameter  of  a  nonempty  subset  S  of  A  is 

d(S)  =  sup  {p(w,  v)\u,  v  <E  T} . 
If  d(S)  <  oo  then  S  is  bounded.  ■ 

As  we  will  see  below,  a  closed  and  bounded  subset  of  a  general  metric  space  may  fail 
to  have  the  Heine-Borel  property.  Since  we  want  "compact"  and  "has  the  Heine-Borel 
property"  to  be  synonymous  in  connection  with  a  general  metric  space,  we  simply  make 
the  following  definition. 

Definition  8.2.2  A  set  T  is  compact  if  it  has  the  Heine-Borel  property. 

Theorem  8.2.3  An  infinite  subset  T  of  A  is  compact  if  and  only  if  every  infinite  subset 
of  T  has  a  limit  point  in  T. 

Proof  Suppose  that  T  has  an  infinite  subset  E  with  no  limit  point  in  T .  Then,  if  t  e  T , 
there  is  an  open  set  Ht  such  that  t  e  Ht  and  Ht  contains  at  most  one  member  of  E.  Then 
M  =  U  [Ht  1 1  e  7}  is  an  open  covering  of  T,  but  no  finite  collection  {Htl ,  Ht2, . . . ,  Htk) 
of  sets  from  jf  can  cover  E,  since  E  is  infinite.  Therefore,  no  such  collection  can  cover  T; 
that  is,  T  is  not  compact. 

Now  suppose  that  every  infinite  subset  of  T  has  a  limit  point  in  T,  and  let  3t  be  an  open 
covering  of  T .  We  first  show  that  there  is  a  sequence  {Hi }~  j  of  sets  from  3t  that  covers 
T. 

If  €  >  0,  then  T  can  be  covered  by  e -neighborhoods  of  finitely  many  points  of  T .  We 
prove  this  by  contradiction.  Let  t\  e  T.  If  Ne(t\)  does  not  cover  T,  there  is  a  f2  €  7  such 
that  p{h,t2)  >  €.  Now  suppose  that  n  >  2  and  we  have  chosen  fi,  f2,  . . . ,  £„  such  that 
(/')  >  f,  1  S  i  <  7  <  «•  If  U"_1Are(fi)  does  not  cover  7",  there  is  a  e  7"  such 
that  p(ti,t„+i)  >  €,  1  <  i  <  n.  Therefore,  p(ti,tj)  >  e,  1  <  i  <  j  <  n  +  1.  Hence, 
by  induction,  if  no  finite  collection  of  e-neighborhoods  of  points  in  T  covers  T,  there  is  an 
infinite  sequence  {t„}™=1  in  7"  such  that  p(ti,tj)  >  e,  i  ^  7.  Such  a  sequence  could  not 
have  a  limit  point,  contrary  to  our  assumption. 

By  taking  e  successively  equal  to  1,  1/2,  . . . ,  l/n,  we  can  now  conclude  that,  for 
each  n,  there  are  points  t\n,  f2«,  ■  •  • ,  tk„,n  sucn  that 

TC  \jNl/n(tin). 

i  =  l 

Denote  5;„  =  jVi/„(f,„),  1  <  i  <  n,  n  >  1,  and  define 


{Gl,  G2,  G3,  ...}  =  {fill,  .  .  .  ,  Bku\,  B12,  .  .  .  ,  fiyfc2,2,  #13,  •  •  •  ,  #yfc3,3,  •  •  •}• 
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If  t  e  T,  there  is  an  H  in  M  such  that  t  &  H .  Since  H  is  open,  there  is  an  e  >  0  such 
that  /Ve(f )  C  H .  Since  f  e  Gj  for  infinitely  many  values  of  j  and  lim/^oo  d(G /)  =  0, 

Gy  C  JVe(0  C  # 

for  some  y .  Therefore,  if  {Gyj  is  the  subsequence  of  {Gj}  such  that  Gyj  is  a  subset  of 
some  Hi  in  <5f  (the  {//, }  are  not  necessarily  distinct),  then 

oo 

Tc\jHh  (8.2.1) 
i=l 

We  will  now  show  that 

N 

Tc{jHi.  (8.2.2) 
i=l 

for  some  integer  N.  If  this  is  not  so,  there  is  an  infinite  sequence  {tn}%Li  in  T  such  that 

n 

t„i{jHi,    n>\.  (8.2.3) 
i=i 

From  our  assumption,  {fn}^!  has  a  limit  7  in  7".  From  (8.2.1),  7  e  /f/t  for  some  fc,  so 
A/e(7)  C  Hfe  for  some  €  >  0.  Since  lim„^oo  fn  =  7,  there  is  an  integer  N  such  that 

n 

t„  e  N€(t)  C  Ffc  C  (J  ff; ,    b  > 
i=i 

which  contradicts  (8.2.3).  This  verifies  (8.2.2),  so  T  is  compact.  TJ 

Any  finite  subset  of  a  metric  space  obviously  has  the  Heine-Borel  property  and  is  there- 
fore compact.  Since  Theorem  8.2.3  does  not  deal  with  finite  sets,  it  is  often  more  convenient 
to  work  with  the  following  criterion  for  compactness,  which  is  also  applicable  to  finite  sets. 

Theorem  8.2.4  A  subset  T  of  a  metric  A  is  compact  if  and  only  if  every  infinite  se- 
quence {tn}  of  members  ofT  has  a  subsequence  that  converges  to  a  member  ofT. 

Proof  Suppose  that  T  is  compact  and  {tn}  C  T .  If  {/„}  has  only  finitely  many  distinct 
terms,  there  is  a  7  in  T  such  that  t„  =  7  for  infinitely  many  values  of  n;  if  this  is  so  for 
n\  <  «2  <  ■  ■■ ,  then  lim/^oo  tnj  =  7.  If  {tn}  has  infinitely  many  distinct  terms,  then  {tn} 
has  a  limit  point  7  in  T,  so  there  are  integers  n\  <  n%  <  ■■■  such  that  p(t„  .  ,7)  <  l/j ; 
therefore,  limy^oo  tnj  =  7. 

Conversely,  suppose  that  every  sequence  in  T  has  a  subsequence  that  converges  to  a  limit 
in  T ,  If  S  is  an  infinite  subset  of  T ,  we  can  choose  a  sequence  {tn}  of  distinct  points  in 
5.  By  assumption,  {tn }  has  a  subsequence  that  converges  to  a  member  7  of  T.  Since  7  is  a 
limit  point  of  {tn},  and  therefore  of  T,  T  is  compact.  TJ 

Theorem  8.2.5  IfT  is  compact,  then  every  Cauchy  sequence  {tn  }^Li  'n  T  converges 
to  a  limit  in  T. 
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Proof   By  Theorem  8.2.4,  {tn}  has  a  subsequence  {tnj  }  such  that 

lim  tn    =  7  eT.  (8.2.4) 

j-*oo 

We  will  show  that  linin^oo  tn  =  7. 

Suppose  that  e  >  0.  Since  {tn}  is  a  Cauchy  sequence,  there  is  an  integer  N  such  that 
p(tn,  tm)  <  e,  n  >  m  >  N.  From  (8.2.4),  there  is  an  m  =  rij  >  N  such  that  p(tm,T)  <  e. 
Therefore, 

p(t»J)  <p(tn,tm)  + p(tm,T)  <2e,    n>m.  H 


Theorem  8.2.6  IfT  is  compact,  then  T  is  closed  and  bounded. 

Proof  Suppose  that  7  is  a  limit  point  of  T .  For  each  n,  choose  tn  ^  7  e  B\/n(t)  n  7". 
Then  lim„^oo  tn  =  7.  Since  every  subsequence  of  {/„}  also  converges  to  7,  7  e  7",  by 
Theorem  8.2.3.  Therefore,  T  is  closed. 

The  family  of  unit  open  balls  M  =  {B\ (?)  \t  e  T}  is  an  open  covering  of  7".  Since  7"  is 
compact,  there  are  finitely  many  members  t\,  ti,  tn  of  T  such  that  S  C  U"=1  B\(t/). 
If  m  and  i)  are  arbitrary  members  of  T,  then  u  e  B\{tr)  and  u  e  B\{ts)  for  some  r  and  5  in 
{1,2,..., «},  so 

p(w,  u)  <  p(u,tr)  +  p(trJs)  +  p(ts,  v) 

<  2  +  p{tr,  ts)  <  2  +  max  {p{tt,tj)  \  1  <  i  <  j  <  "}  • 

Therefore,  7"  is  bounded.  H 

The  converse  of  Theorem  8.2.6  is  false;  for  example,  if  A  is  any  infinite  set  equipped 
with  the  discrete  metric  (Example  8.1.2.),  then  every  subset  of  A  is  bounded  and  closed. 
However,  if  T  is  an  infinite  subset  of  A,  then  3t  =  {{?}  1 1  e  7"}  is  an  open  covering  of  T, 
but  no  finite  subfamily  of  3i  covers  T . 

Definition  8.2.7  A  set  T  is  totally  bounded  if  for  every  e  >  0  there  is  a  finite  set  T€ 
with  the  following  property:  if  t  e  T,  there  is  an  s  €  T€  such  that  p(s,  t)  <  e.  We  say  that 
Te  is  a  finite  e-netfor  T .  ■ 

We  leave  it  to  you  (Exercise  8.2.4)  to  show  that  every  totally  bounded  set  is  bounded  and 
that  the  converse  is  false. 

Theorem  8.2.8  IfTis  compact,  then  T  is  totally  bounded. 

Proof  We  will  prove  that  if  T  is  not  totally  bounded,  then  T  is  not  compact.  If  T  is  not 
totally  bounded,  there  is  an  e  >  0  such  that  there  is  no  finite  €-net  for  T .  Let  t\  e  T .  Then 
there  must  be  a  t2  in  T  such  that  p(t\,  ti)  >  e.  (If  not,  the  singleton  set  {ti}  would  be  a 
finite  e-net  for  7".)  Now  suppose  that  n  >  2  and  we  have  chosen  t\,  t^,  tn  such  that 
p(U,  tj)  >  6,  1  <  i  <  j  <  7i.  Then  there  must  be  a  tn+\  e  T  such  that  p(ti,  f«+i)  >  e, 
1  <  i  <  n.  (If  not,  {t\,  tz, . . . ,  tn}  would  be  a  finite  e-net  for  T .)  Therefore,  p(ti,  tj)  >  e, 
\<i<j<n  +  \.  Hence,  by  induction,  there  is  an  infinite  sequence  {tn}^=1  in  T  such 
that  p(tj  ,tj)  >  e,  i  7^  y .  Since  such  a  sequence  has  no  limit  point,  T  is  not  compact,  by 
Theorem  8.2.4.  TJ 
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Theorem  8.2.9  If  (A,  p)  is  complete  and  T  is  closed  and  totally  bounded,  then  T  is 
compact. 

Proof  Let  S  be  an  infinite  subset  of  T,  and  let  {si  }f?_1  be  a  sequence  of  distinct  members 
of  S.  We  will  show  that  {i,  has  a  convergent  subsequence.  Since  T  is  closed,  the  limit 
of  this  subsequence  is  in  T,  which  implies  that  T  is  compact,  by  Theorem  8.2.4. 

For  n  >  1,  let  T\jn  be  a  finite  l/«-net  for  T.  Let  {st^fL^  =  fai}^!-  Since  T\  is 
finite  and  {5/o}^i  is  infinite,  there  must  be  a  member  t\  of  T\  such  that  pisio,  t\)  <  1 
for  infinitely  many  values  of  i.  Let  {snYfL-,  be  the  subsequence  of  {siqY^L,  such  that 
p(sn,h)  <  1. 

We  continue  by  induction.  Suppose  that  n  >  1  and  we  have  chosen  an  infinite  subse- 
quence {si,n-\}fL\  °f  {si,n-2]fL\-  Since  T\/n  is  finite  and  {si,n-\}fL\  is  infinite,  there 
must  be  member  tn  of  T\jn  such  that  p(si,„-i,  t„)  <  l/n  for  infinitely  many  values  of 
i.  Let  {sin}'^=i  be  the  subsequence  of  {si^-^^i  such  that  p(Si„,  tn)  <  l/n.  From  the 
triangle  inequality, 

p(si„,sjn)  <  2/n,    i,j  >  1,    n  >  1.  (8.2.5) 

Now  let  s";  =  s / j ,  i  >  1.  Then  is  an  infinite  sequence  of  members  of  7".  Mo- 

roever,  if  i,j  >  /i,  then'?,-  andT/  are  both  included  in  {sin}f2.1,  so  (8.2.5)  implies  that 
p(si,Sj)  <  2/«;  that  is,  {5;}^!  is  a  Cauchy  sequence  and  therefore  has  a  limit,  since 
(A,  p)  is  complete.  H 

Example  8.2.1  Let  T  be  the  subset  of  such  that  |x,  |  <  >  1,  where  lim/^oo /z,  = 
0.  Show  that  7"  is  compact. 

Solution  We  will  show  that  T  is  totally  bounded  in  -£oo-  Since  is  complete  (Exer- 
cise 8.1.26),  Theorem  8.2.9  will  then  imply  that  T  is  compact. 

Let  e  >  0.  Choose  N  so  that  /i,-  <  e  if  (  >  N .  Let  pi  =  max  {/i,-  1  <  i  <  «}  and  let  p 
be  an  integer  such  that  pe  >  p,.  Let  Q€  =  {r,e  |  r;-  =  integer  in[—  p,  />]}.  Then  the  subset 
of  ioo  such  that  x;-  e  2e>  1  <  i  <  N,  and     =  0,  i  >  N,  is  a  finite  e-net  for  T. 

Compact  Subsets  of  C[a,  b] 

In  Example  8.1.7  we  showed  that  C[a,b]  is  a  complete  metric  space  under  the  metric 

P(Lg)  =  =  max {!/(•*) -#(•*) I  |  a  <  x  <  b) . 

We  will  now  give  necessary  and  sufficient  conditions  for  a  subset  of  C  [a,  b]  to  be  compact. 

Definition  8.2.10  A  subset  T  of  C  [a,  b]  is  uniformly  bounded  if  there  is  a  constant  M 
such  that 

|/(x)|  <  M    if    a  <  jc  <  b    and     /el  (8.2.6) 
A  subset  T  of  C  [a,  b]  is  equicontinuous  if  for  each  e  >  0  there  is  a  5  >  0  such  that 

\f(xi)- f(x2)\  <e    if    xi,x2  e  [a,b],     |xi-x2|<<5,    and    f  €  T.  (8.2.7) 
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Theorem  2.2.8  implies  that  for  each  /  in  C  [a,  b]  there  is  a  constant  Mf  which  depends 
on  f,  such  that 

1/0)1  <  Mf    if    a  <  x  <b, 
and  Theorem  2.2. 12  implies  that  there  is  a  constant  S /  which  depends  on  f  and  e  such  that 

\f{x\)  —  f{x2)\<e    if    X\,x2e[a,b]    and     |xi  -  x2|  <  <5/. 

The  difference  in  Definition  8.2.1 1  is  that  the  same  M  and  S  apply  to  all  f  in  T. 

Theorem  8.2.11  A  nonempty  subset  T  ofC[a,b]  is  compact  if  and  only  if  it  is  closed, 
uniformly  bounded,  and  equicontinuous. 

Proof   For  necessity,  suppose  that  T  is  compact.  Then  T  is  closed  (Theorem  8.2.6) 
and  totally  bounded  (Theorem  8.2.8).  Therefore,  if  e  >  0,  there  is  a  finite  subset  T€  = 
{gi<  g2,  ■  ■  ■ ,  ft}  °f  C[a,  b]  such  that  if  /  e  T,  then  ||  f—gj  ||  <  €  for  some  i  in{l,  2, . . . ,  k}. 
If  we  temporarily  let  e  =  1,  this  implies  that 

11/11  =  IK/ -gd  +  gi\\  <  11/ -ft II  +  lift II  <  i  +  lift  II, 

which  implies  (8.2.6)  with 

M  =  1  +  max  {\\gi  ||  1 1  <  i  <  k)  . 
For  (8.2.7),  we  again  let  6  be  arbitary,  and  write 

|/(xi)  -  f(x2)\  <  |/O0  -  gi(Xl)\  +  \gi(xi)  -  gi(x2)\  +  \gi(x2)  -  f(x2)\ 

<\gi(xi)-gi(x2)\+  2||/ -gi\\  (8.2.8) 
<  \gi(xi)  -  gi(x2)\  +  2e. 

Since  each  of  the  finitely  many  functions  g\,  g2,  . . . ,  gk  is  uniformly  continuous  on  [a,  b] 
(Theorem  2.2.12),  there  is  a  S  >  0  such  that 

\gi(xi)-gi(x2)\  <e    if    \xi-x2\<&,  l<i<k. 

This  and  (8.2.8)  imply  (8.2.7)  with  e  replaced  by  3e.  Since  this  replacement  is  of  no 
consequence,  this  proves  necessity. 

For  sufficiency,  we  will  show  that  T  is  totally  bounded.  Since  T  is  closed  by  assumption 
and  C[a,  b]  is  complete,  Theorem  8.2.9  will  then  imply  that  T  is  compact. 

Let  m  and  n  be  positive  integers  and  let 

r  sM 

t-r  =  a  H  (b  —  a),    0  <  r  <  m,    and     r]s  =  ,  —n<s<n; 

m  n 

that  is,  a  =  £o  <  £i  <  •  ■  •  <  £m  =  b  is  a  partition  of  [a,  b]  into  subintervals  of  length 
(b  —  a)/m,  and  —  M  =         <  <  ■•■  <  <  /]„  =  M  is  a  partition  of  the 
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segment  of  the  j-axis  between  y  =  — M  and  y  =  M  into  subsegments  of  length  M/n. 
Let  Smn  be  the  subset  of  C[a,  b]  consisting  of  functions  g  such  that 

{#(£<>),  g(%l),  •  •  •  ,  g(£m)}  C  {r]-n,  7]-„  +  i  7]„-i,  T}„} 

and  g  is  linear  on  \  <  i  <  m.  Since  there  are  only  (m  +  1)(2«  +  1)  points  of  the 

form  (£r,  T]s),  Smn  is  a  finite  subset  of  C[a,  b]. 

Now  suppose  that  e  >  0,  and  choose  8  >  0  to  satisfy  (8.2.7).  Choose  m  and  «  so  that 
(b  -  a)/m  <  8  and  2 M/n  <  e.  If  /  is  an  arbitrary  member  of  T ,  there  is  a  g  in  Smw  such 
that 

- /(&)l  <  e.    0</  <m.  (8.2.9) 

If  0  <  ('  <  m  -  1, 

IfdO-^fe+OI  =  lf(ft)-/(li)l  +  l/&)-/(&+i)H-|/(ft+i)-f(ft+i)|.  (8-2.10) 
Since        -  &  <  8,  (8.2.7),  (8.2.9),  and  (8.2.10)  imply  that 

lf(&)-*(&+i)l<3e. 

Therefore, 

<3e,  (8.2.11) 

since  g  is  linear  on  [^-, 

Now  let  x  be  an  arbitrary  point  in  [a,b],  and  choose  i  so  that  x  e  [fj,  Then 

|/W-g(x)|  <  |/(X)-  +  ~g&)\  +  \gfo)-g(.*)\, 

so  (8.2.7),  (8.2.9),  and  (8.2.11)  imply  that  \f(x)-g(x)\  <5e,a<x<  b.  Therefore,  Sm„ 
is  a  finite  5e-net  for  T ,  so  T  is  totally  bounded.  H 

Theorem  8.2.12  (Ascoli  Arzela  Theorem)  Suppose  that  !F  is  an  infinite  uni- 
formly bounded  and  equicontinuous  family  of  functions  on  [a,b].  Then  there  is  a  sequence 
{fn}  in      that  converges  uniformly  to  a  continuous  function  on  [a,  b]. 

Proof  Let  T  be  the  closure  of  !F;  that  is,  /  6  T  if  and  only  if  either  /  e  T  or  / 
is  the  uniform  limit  of  a  sequence  of  members  of  3< .  Then  T  is  also  uniformly  bounded 
and  equicontinuous  (verify),  and  T  is  closed.  Hence,  T  is  compact,  by  Theorem  8.2.12. 
Therefore,  "F  has  a  limit  point  in  T .  (In  this  context,  the  limit  point  is  a  function  /  in 
T .)  Since  /  is  a  limit  point  of  3> ,  there  is  for  each  integer  n  a  function  /„  in  S>  such  that 
II  fn  —  f  II  <  l/w;  that  is  {/„ }  converges  uniformly  to  /  on  [a,  b].  H 

8.2  Exercises 


1.     Suppose  that  T\,  7~2,  . . . ,  7~jt  are  compact  sets  in  a  metric  space  (A,  p).  Show  that 
Uj=lTj  is  compact. 
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2.  (a)    Show  that  a  closed  subset  of  a  compact  set  is  compact. 

(b)  Suppose  that  T  is  any  collection  of  closed  subsets  of  a  metric  space  (^4,  p), 
and  some  T  in  T  is  compact.  Show  that  fl  {7"  |  T  e  7"}  is  compact. 

(c)  Show  that  if  T  is  a  collection  of  compact  subsets  of  a  metric  space  (^4,  p), 
thenn  {T  \  T  e  T}  is  compact. 

3.  If  S  and  T  are  nonempty  subsets  of  a  metric  space  (^4,  p),  we  define  the  distance 
from  S  to  T  by 

dist(S,r)  =M{p(s,t)\s  €  S,  t  e  T) . 

Show  that  if  S  and  T  are  compact,  then  dist(S,  7")  =  p(s,  t)  for  some  s  in  S  and 
some  f  in  7\ 

4.  (a)    Show  that  every  totally  bounded  set  is  bounded, 
(b)  Let 


and  let  T  be  the  subset  of  consisting  of  the  sequences  Xr  =  {&ir}TLv 
r  >  1 .  Show  that  T  is  bounded,  but  not  totally  bounded. 

5.  Let  T  be  a  compact  subset  of  a  metric  space  (^4,  p).  Show  that  there  are  members  1 
and  7  of  T  such  that  <i(5, 7)  =  d(T). 

6.  Let  r  be  the  subset  of  l\  such  that  |x;  |  <  /X;,  i  >  1,  where  Y^h=\  fa  <  00 •  Show 
that  T  is  compact. 

7.  Let  T  be  the  subset  of  I2  such  that  [jtj  |  <  fii,  i  >  1,  where  ^.^  <  00.  Show 
that  T  is  compact. 

8.  Let  S  be  a  nonempty  subset  of  a  metric  space  (A,  p)  and  let  wo  be  an  arbitrary 
member  of  A.  Show  that  5  is  bounded  if  and  only  if  D  =  {p(u,  uq)  |  u  e  S}  is 
bounded. 

9.  Let  (j4,  p)  be  a  metric  space. 

(a)  Prove:  If  S  is  a  bounded  subset  of  ^4,  then  5  (closure  of  5)  is  bounded.  Find 
d(S). 

(b)  Prove:  If  every  bounded  closed  subset  of  A  is  compact,  then  (^4,  p)  is  com- 
plete. 

10.  Let  (A,  p)  be  the  metric  space  defined  in  Exercise  8.1.16  Let 

T  =  TlxT2x---xTk, 

where  Tt  C  At  and  7}  ^  0,  1  <  i  <  £.  Show  that  T  is  compact  if  and  only  7j  is 
compact  for  1  <  /  <  fc. 

11.  Let  (^4,  p)  be  the  metric  space  defined  in  Exercise  8.1.17.  Let 

T  =  7\  x  T2  x  •  •  •  x  T„  x  •  •  •  , 
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where  7}  C  A\  and  T,  ^  0,  i  >  1.  Show  that  if  T  is  compact,  then  is  compact 
for  all  i  >  1. 

12.  Let  {T/j}^!  be  a  sequence  of  nonempty  closed  sets  of  a  metric  space  such  that  (a) 
T\  is  compact;  (b)  Tn+\  C  Tn,  n  >  1;  and  (c)  lim„^oo  d{T„)  =  0.  Show  that 
(~1^=1 7"„  contains  exactly  one  member. 

8.3  CONTINUOUS  FUNCTIONS  ON  METRIC  SPACES 

In  Chapter  we  studied  real-valued  functions  defined  on  subsets  of  W,  and  in  Chapter  6.4. 
we  studied  functions  defined  on  subsets  of  R"  with  values  in  Km .  These  are  examples  of 
functions  defined  on  one  metric  space  with  values  in  another  metric  space. (Of  course,  the 
two  spaces  are  the  same  if  n  =  m.) 

In  this  section  we  briefly  consider  functions  defined  on  subsets  of  a  metric  space  {A,  p) 
with  values  in  a  metric  space  (B,  a).  We  indicate  that  /  is  such  a  function  by  writing 

f:(A,p)^(B,a). 

The  domain  and  range  of  /  are  the  sets 

D  f  =  {u  e  A  |  f(u)  is  defined} 

and 

Rf  =  {v&B\v  =  f  (u)  for  some  u  in  Z)/}  . 
Definition  8.3.1  We  say  that 

lim  / (u)  = 

ifu  e  D  f  and  for  each  e  >  0  there  is  a  S  >  0  such  that 

o{f(u),v)  <  e    if    u&Df    and     0  <  p(u,u)  <  8.  (8.3.1) 

Definition  8.3.2  We  say  that  /  is  continuous  at  u  ifu  e  D  r  and  for  each  e  >  0  there 
is  a  S  >  0  such  that 

o-(/(w),  /("))  <  e    if    "  e  Df  nNs(u).  (8.3.2) 
If  /  is  continuous  at  every  point  of  a  set  5,  then  /  is  continuous  on  S.  ■ 

Note  that  (8.3.2)  can  be  written  as 

f(DfnNs(u))cNe(f(u)). 
Also,  /  is  automatically  continuous  at  every  isolated  point  of  D  f .  (Why?) 
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Example  8.3.1  If  (^4.  ||  ■  ||)  is  a  normed  vector  space,  then  Theorem  8.3.5  implies  that 
f(u)  =  \\u\\  is  a  continuous  function  from  (^4,  p)  to  R,  since 

WWW  —  \\u\\  I  —  \\u  —  u\\. 
Here  we  are  applying  Definition  8.3.2  with  p(u,  u)  =  \\u  —  u\\  anda(i>,iT)  =  [ u  —  v"[ . 

Theorem  8.3.3  Suppose  that u  e  D  f.  Then 

lim  f(u)  =  v  (8.3.3) 

u — >u 

if  and  only  if 

lim  f(u„)  =  v  (8.3.4) 

n— *oo 

for  every  sequence  {u„}  in  D f  such  that 

lim  un  =  u.  (8.3.5) 

n— >oo 

Proof  Suppose  that  (8.3.3)  is  true,  and  let  {«„}  be  a  sequence  in  Df  that  satisfies 
(8.3.5).  Let  e  >  0  and  choose  S  >  0  to  satisfy  (8.3.1).  From  (8.3.5),  there  is  an  inte- 
ger N  such  that  p(u„,u)  <  S  if  n  >  N.  Therefore,  <7(/(w„),  iT)  <  e  if  n  >  N,  which 
implies  (8.3.4). 

For  the  converse,  suppose  that  (8.3.3)  is  false.  Then  there  is  an  €q  >  0  and  a  sequence 
{«„}  in  D  f  such  that  p{un,u)  <  l/n  and  o(f  (m„),1T)  >  €o,  so  (8.3.4)  is  false.  H 

We  leave  the  proof  of  the  next  two  theorems  to  you. 
Theorem  8.3.4  A  function  f  is  continuous  at  u  if  and  only  if 

lirn  f(u)  =  f(u). 

u—>u 

Theorem  8.3.5  A  function  f  is  continuous  at  u  if  and  only  if 

lim  f(un)  =  f(u) 

n — ^oo 

whenever  {un}  is  a  sequence  in  D  f  that  converges  to  u. 

Theorem  8.3.6  If  f  is  continuous  on  a  compact  set  T,  then  f  (T)  is  compact. 

Proof  Let  {vn}  be  an  infinite  sequence  in  f(T).  For  each  «,  t;„  =  f(u„)  for  some  un  e 
T .  Since  T  is  compact,  {u„}  has  a  subsequence  {unj}  such  that  limy^oo  unj  =  u  e  T 
(Theorem  8.2.4).  From  Theorem  8.3.5,  limy^oo  f(unj  )  =  f(u);  that  is,  lim/_>00  vnj  = 
f  (u).  Therefore,  / (T)  is  compact,  again  by  Theorem  8.2.4.  H 

Definition  8.3.7  A  function  /  is  uniformly  continuous  on  a  subset  S  of  D  f  if  for  each 
e  >  0  there  is  a  S  >  0  such  that 


a(  f(u),  f(v))  <  e    whenever    p(u,  v)  <  S    and     u,  v  e  S. 
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Theorem  8.3.8  Iff  is  continuous  on  a  compact  set  T,  then  f  is  uniformly  continuous 
on  T. 

Proof  If  /  is  not  uniformly  continuous  on  T,  then  for  some  Co  >  0  there  are  sequences 
{un}  and  {vn}  in  T  such  that  p(un,  v„)  <  l/n  and 

o(f(un),f(vn))>e0.  (8.3.6) 

Since  T  is  compact,  {un }  has  a  subsequence  {u„k }  that  converges  to  a  limit  u  in  T  (Theo- 
rem 8.2.4).  Since  p(unfc,  v„k)  <  l/«/t,  lim/^oo  v„k  =  u  also.  Then 

lim  f(u„k)  =  lim  f(v„k)  =  f(u) 

k—roo  K — >oo 

(Theorem  8.3.5),  which  contradicts  (8.3.6).  H 

Definition  8.3.9  If  /  :  (A,  p)  (A,  p)  is  defined  on  all  of  A  and  there  is  a  constant  a 
in  (0,  1)  such  that 

p(f(u),  /(«))  <  ap(u,  v)    for  all    (u,  v)  e  A  x  A,  (8.3.7) 
then  /  is  a  contraction  of  (A,  p).  ■ 

We  note  that  a  contraction  of  (^4,  p)  is  uniformly  continuous  on  A. 

Theorem  8.3.10  (Contraction  Mapping  Theorem)  Iff  is  a  contraction 
of  a  complete  metric  space  {A,  p),  then  the  equation 

f(u)  =  u  (8.3.8) 

has  a  unique  solution. 

Proof  To  see  that  (8.3.8)  cannot  have  more  than  one  solution,  suppose  that  u  =  f(u) 
and  v  =  f(v).  Then 

p(u,v)  =  p(f(u),f(v)).  (8.3.9) 

However,  (8.3.7)  implies  that 

p(f(u),f(v))<ap(u,v).  (8.3.10) 
Since  (8.3.9)  and  (8.3.10)  imply  that 

p(w,  v)  <  ap(u,  v) 

and  a  <  1,  it  follows  that  p(u,  v)  =  0.  Hence  u  =  v. 

We  will  now  show  that  (8.3.8)  has  a  solution.  With  uq  arbitrary,  define 

w„  =  /(u„_i),    n>\.  (8.3.11) 

We  will  show  that  {un}  converges.  From  (8.3.7)  and  (8.3.1 1), 

p(u„+i,u„)  =  p(f(u„),  f(u„-\))  <  ap(u„,u„-i).  (8.3.12) 
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The  inequality 

p(un+i ,  u„)  <  anp(u\,  u0),    n  >  0,  (8.3.13) 

follows  by  induction  from  (8.3. 12).  If«  >  m,  repeated  application  of  the  triangle  inequality 
yields 

p(u„,um)  <  p(u„,u„-i)  +  p(u„-i,u„-2)  H  h  p(um+i,um), 

and  (8.3.13)  yields 


p{u„,um)  <  p(ui,u0)am(l  +a-\  ha"  m  x)  < 


or 


1  -a 
Now  it  follows  that 

p(ui,u0)  N 

p(un,um)  <   a      if  n,m>N, 

I  —  a 

and,  since  lini/v^no  aN  =  0,  {m„}  is  a  Cauchy  sequence.  Since  ^4  is  complete,  {u„}  has  a 
limit  it.  Since  /  is  continuous  at  u, 

f(u)  =  lim  f(un-\)  =  lim  w„  =  u, 

n— too  n— >oo 

where  Theorem  8.3.5  implies  the  first  equality  and  (8.3.1 1)  implies  the  second.  H 

Example  8.3.2  Suppose  that  h  =  h(x)  is  continuous  on  [a,  b],  K  =  K(x,  y)  is  con- 
tinuous on  [a,b]  x  [a,b],  and  <  M  if  a  <  x,  y  <  b.  Show  that  if  |A|  < 
1  / M(b  —  a)  there  is  a  unique  it  in  C [a,  b]  such  that 

u(x)  =  h(x)  +  X  f  K(x,y)u(y)dy,     a  <  x  <  b.  (8.3.14) 

J  a 

(This  is  Fredholm's  integral  equation.) 

Solution  Let  A  be  C[a,  b],  which  is  complete.  If  u  e  C[a,  b],  let  f(u)  =  v,  where 

fb 

v(x)  =  h(x)  +  X  I    K(x,  y)u(y)  dy,     a  <  x  <  b. 

J  a 

Since  v  e  C[a,  b],  f  :  C[a,b]  — >  C[a,  b].  If  u\,  u-i  e  C[a,  b],  then 

\v1(x)-v2(x)\  <  \X\  f  \K(x,y)\\ul(y)-vl(y)\dy, 

J  a 

SO 

\\vi  -  v2\\  <  \X\M(b -a)\\ui  -  u2\\. 

Since  |A|M(i  —  a)  <  1,  /  is  a  contraction.  Hence,  there  is  a  unique  u  in  C[a,  b]  such  that 
f(u)  =  u.  This  u  satisfies  (8.3.14). 
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1.  Suppose  that  /  :  (A,  p)  (B,o)  and  Df  =  A.  Show  that  the  following  state- 
ments are  equivalent. 

(a)  /  is  continuous  on  A. 

(b)  If  V  is  any  open  set  in  (B,  cr),  then  f~l  (V)  is  open  in  {A,  p). 

(c)  If  V  is  any  closed  set  in  (B,  a),  then  f~l(V)  is  closed  in  (A,  p). 

2.  A  metric  space  (A,  p)  is  connected  if  A  cannot  be  written  as  A  =  A\  U  A2,  where 
A\  and  A2  are  nonempty  disjoint  open  sets.  Suppose  that  (A,  p)  is  connected  and 
/  :  {A,  p)  — >  (B,  a),  where  D/  =  A,  R/  =  B,  and  /  is  continuous  on  A.  Show 
that  (B,  a)  is  connected. 

3.  Let  /  be  a  continuous  real-valued  function  on  a  compact  subset  5  of  a  metric  space 
(A,  p).  Let  a  be  the  usual  metric  on  R;  that  is,  a(x,  y)  =  \x  —  y\. 

(a)  Show  that  /  is  bounded  on  5. 

(b)  Let  a  =  infM6s  f(u)  and  p1  =  supueS  f(u).  Show  that  there  are  points  u\ 
and  u2  in  [a,  b]  such  that  f{u\)  =  a  and  f(u2)  =  fi- 

4.  Let  /  :  {A,  p)  — >  (B,  a)  be  continuous  on  a  subset  U  of  A.  Let  u  be  in  U  and 
define  the  real- valued  function  g  :  (A,  p)  — >  R  by 

g(u)  =  o(f(u),  f(u)),  U€U. 

(a)  Show  that  g  is  continuous  on  U. 

(b)  Show  that  if  U  is  compact,  then  g  is  uniformly  continuous  on  U . 

(c)  Show  that  if  U  is  compact,  then  there  is  a  u  e  U  such  that  g(u)  <  g(u), 
u  €  U . 

5.  Suppose  that  (A,  p),  (B,  ct),  and  (C,  y)  are  metric  spaces,  and  let 

f  :{A,p)^{B,a)    and    g  :  (B,  o)  ->  (C,  y), 

where  D f  =  A,  Rf  =  Dg  =  B,  and  /  and  g  are  continuous.  Define  :  (/4,  p)  — > 
(C,  y)  by        =  g(/ («)).  Show  that  h  is  continuous  on  A. 

6.  Let  (A,  p)  be  the  set  of  all  bounded  real- valued  functions  on  a  nonempty  set  S, 
with  p(u,  v)  =  supieS  \u(s)  —  v(s)\.  Let  si,  s2,  sjfc  be  members  of  S,  and 
f(u)  =  g(u(si),  u(s2),  ■  ■  ■ ,  u(sk)),  where  g  is  real-valued  and  continuous  on  Rk . 
Show  that  /  is  a  continuous  function  from  (A,  p)  to  R. 

7.  Let  (^4,  p)  be  the  set  of  all  bounded  real- valued  functions  on  a  nonempty  set  S, 
with  p(u,  v)  =  supJgS  \u(s)  —  v(s)\.  Show  that  f{u)  =  infj€s  u(s)  and  g(u)  = 
suPseS  u(s)  aie  uniformly  continuous  functions  from  (A,  p)  to  R. 

8.  Let  I[a,b]  be  the  set  of  all  real-valued  functions  that  are  Riemann  integrable  on 

fb 

[a,  b],  with  p(u,  v)  =  supa<x<fc  \u(x)  —  v(x)\.  Show  that  f(u)  =  I  u(x)dx'isa 

~  ~  J  a 

uniformly  continuous  function  from  I[a,b]  to  R. 


548    Answers  to  Selected  Exercises 


Answers  to  Selected 
Exercises 


Section  1.1    pp.  9-10 

1.1.1  (p.  9)  (a)  2max(a,b)    (b)2min(a,6)    (c)  4max(<a,  b,  c)    (d)  4min(<a,  b,  c) 

1.1.5  (p.  9)  (a)  oo  (no);  -1  (yes)    (b)  3  (no);  -3  (no)    (c)  V7  (yes);  -yfi  (yes) 
(d)  2  (no);  -3  (no)     (e)  1  (no);  -1  (no)    (f)  VI  (no);  -«Jl  (no) 

Section  1.2    pp.  15-19 

1.2.9  (p.  16)  (a)  2"/{2n)\     (b)  2- 3"/(2«  +  1)!     (c)  2""(2«)!/(«!)2     (d)  n"/nl 

1.2.10  (p.  16)  (b)  no    1.2.11  (p.  16)  (b)  no 


1.2.21  (p.  18)  fn(xi,x2,  .  ..,x„)  =  2"  1max(x1,x2, ...,x„),  g„(xi,x2, ...,x„)  = 
2""1  min(xi,  x2, . . . ,  xn) 

Section  1.3    pp.  27-29 

1.3.1  (p.  27)  (a)  [I  l);(-oo,  ±)U[l,oo);  (-oo,0]U(§,oo);(0,  §];(-oo,0]U(|,oo); 
(-oo,  |]  U  [1,  oo)  (b)  (-3,  -2)  U  (2,  3);  (-oo,  -3]  U  [-2,  2]  U  [3,  oo);  0;  (-oo,  oo);  0; 
(-oo,  -3]  U  [-2,  2]  U  [3,  oo)  (c)  0;  (-oo,  oo);  0;  (-oo,  oo);  0;  (-oo,  oo) 

(d)  0;  (-oo,  oo);  [-1,  1];  (-oo,  -1)  U  (1,  oo);  [-1,  1];  (-oo,  oo) 

1.3.2  (p.  27)  (a)  (0,  3]    (b)  [0, 2]    (c)  (-oo,  1)  U  (2,  oo)     (d)  (-oo,  0]  U  (3,  oo) 


1.3.4  (p.  27)  (a)  i     (b)  ±     (c)  6    (d)  1 
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1.3.5  (p.  27)  (a)  neither;  (-1,2)  U  (3,oo);  (-oo,-l)  U  (2,3);  (-oo,-l]  U  (2,3); 
(-oo,-l]  U  [2,  3]  (b)  open;  S;  (1,2);  [1,2]  (c)  closed;  (-3,  -2)  U  (7,  8);  (-oo,  -3)  U 
(-2,7)U(8,oo);  (-oo-3]  U[-2,  7]  U[8,  oo)  (d)  closed;  0;  (J  {(n,  n  +  1)  |  n  =  integer}; 
(— oo, oo) 

1.3.20  (p.  28)  (a)  {x  \  x  =  l/n,  n  =  1,2, . . .};  (b)  0  (c),  (d)  Si  =  rationals, 
S2  =  irrationals  (e)  any  set  whose  supremum  is  an  isolated  point  of  the  set  (f ),  (g)  the 
rationals  (h)  Si  =  rationals,  S2  =  irrationals 


Section  2.1    pp.  48-53 

2.1.2  (p.  48)  D f  =  [-2, 1)  U  [3, 00),  Dg  =  (-00,  -3]  U  [3, 7)  U  (7, 00),  D f±g  = 
Dfg  =  [3,  7)  U  (7,  00),  D f/g  =  (3,  4)  U  (4,  7)  U  (7,  00) 

2.1.3  (p.  48)  (a),  (b)  {x\x  ^  (2k  +  \)n/2  where  k  =  integer}  (c)  {x  \  x  ^  0, 1} 
(d)  {x\x^Q}    (e)  [l,oo) 

2.1.4  (p.  49)  (a)  4    (b)  12    (c)  -1     (d)  2     (e)  -2 

2.1.6  (p.  49)  (a)  ii    (b)-§     (c)i     (d)  2 

2.1.7  (p.  49)  (a)  0, 2    (b)  0,  none     (c)  -±,  i     (d)  none,  0 
2.1.15  (p.  50)  (a)  0    (b)  0    (c)  none     (d)  0     (e)  none  (f )  0 
2.1.18  (p.  50)  (a)  0     (b)  0    (c)  none     (d)  none     (e)  none  (f )  0 
2.1.20  (p.  50)  (a)  00     (b)  -00     (c)  00     (d)  00     (e)  00  (f)  -00 
2.1.22  (p.  51)  (a)  none    (b)  00    (c)  00    (d)  none 

2.1.24  (p.  51)  (a)  00    (b)  00     (c)  00     (d)  -00     (e)  none    (f)  00 

2.1.31  (p.  52)  (a)  §     (b)  §    (c)  00     (d)  -00     (e)  00    (f)  \ 

2.1.32  (p.  52)  linij^oo  r(x)  =  00  if  n  >  m  and  an/bm  >  0;  =  —00  if  n  >  m  and 
an/bm  <  0;  =  an/bm  if  n  =  m;  =  Oif  n  <  m.  lim;(:_>_00  r(x)  =  (-l)"~m  Hindoo  r(x) 

2.1.33  (p.  52)  limx^xo  f(x)  =  limx^xo  g{x) 

2.1.37  (p.  52)  (c)  m^^-if-g^x)  <  IS^0_  /(x)-limx^0_g(x);limx^;C0_(/- 
g)(x)  >  limx^xo_  f(x)  -  linw^-  g(x) 

Section  2.2    pp.  69-73 

2.2.3  (p.  69)  (a)  from  the  right     (b)  continuous     (c)  none     (d)  continuous  (e) 
none     (f )  continuous     (g)  from  the  left 

2.2.4  (p.  69)  [0,1),  (0,1),  [1,2),  (1,2),  (1,2],  [1,2]     2.2.5  (p.  69)  [0,1),  (0,1), 
(l,oo)2.2.13  (p.  70)  (b)  tanhx  is  continuous  for  all  x,  cothx  for  all  x  ^  0 
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2.2.16  (p.  70)  No    2.2.21  (p.  71)  (a)  [-1, 1],  [0,  oo)  (b)  \JT=-™(2n*-  0  + 

(0,  oo)  (c)  Ur=-oo(«^  («  +         (-oo,-l)  U  (-1, 1)  U  (1,  oo)  (d)  U~=-oo["^  ("  + 

£)w],[0,oo) 

2.2.23  (p.  71)  (a)  (-1, 1)    (b)  (-00,00)     (c)  x0  ^  (2fc  +  §7r),  it  =  integer  (d) 
1/  i       (e)  x  ^  1     (f)  jc  7^  (fc  +  jTr),     =  integer     (g)  ±jr), 
integer  (h)  x  ^  0     (i)  x  ^  0 


Section  2.3    pp.  84-88 

2.3.4  (p.  85)  (b)  p(c)  =  q(c)  and  p'_(c)  =  q'+(c) 

2.3.5  (p.  85)  f(k){x)  =  n(n-l)  ■  ■  ■  (n-k-l)x"-k-1  \x\if  1  <  it  <  /(b)(jc)  =  n\ 
if  x  >  0;  f{n)(x)  =  -n\  if  x  <  0;  /(fc)(jc)  =  0  if  >  n  and  x  7^  0;  /(fc)(0)  does  not 
exist  if  k  >  n. 

2.3.7  (p.  85)  (a)  c'  =  ac  -  bs,  s'  =  be  +  as  (b)  c(x)  =  eax  cosbx,  s(x)  = 
eax  sinbx 

2.3.15  (p.  86)  (b)  f(x)  =  -1  if  x  <  0,  f(x)  =  1  if  x  >  0;  then  /'(0+)  =  0,  but 
/+ (0)  does  not  exist,  (c)  continuous  from  the  right 
2.3.22  (p.  87)  There  is  no  such  function  (Theorem  2.3.9). 

2.3.24  (p.  87)  Counterexample:  Let  x0  =  0,  f(x)  =  |jc  |3/2  sin(l/jc)  if  x  ^  0,  and 
f(P)  =  0  . 

2.3.27  (p.  88)  Counterexample:  Let  x0  =  0,  f(x)  =  x/\x\  if  x  ^  0,  /(0)  =  0. 


Section  2.4    pp.  96-98 


2.4.2  (p.  96)  1 
2.4.6  (p.  96)  1 

2.4.10  (p.  96)  0 
2.4.14  (p.  96)  -i 
2.4.18  (p. 

2.4.24  (p. 

2.4.23  (p 

2.4.25  (p 

2.4.29  (p 

2.4.30  (p.  96)  00 
2.4.34  (p.  97) 


2.4.3  (p.  96)  i 
2.4.7  (p.  96)  0 


2.4.4  (p.  96)  00  2.4.5  (p.  96)  {-\)n~ln 
2.4.8  (p.  96)  1      2.4.9  (p.  96)  0 

2.4.11  (p.  96)  0    2.4.12  (p.  96)  -00    2.4.13  (p.  96)  0 


2.4.15  (p.  96)  0 
96)  1  ~  2.4.19  (p.  96)  1 
96)  \/e    2.4.22  (p.  96)  0 

96)  -00  if  a  <  0,  Oifa  >  0 

96)  e2    2.4.26  (p.  96)  1 

96)  00  if  a  >  0,  -00  if  a  <  0 

2.4.31  (p.  97)  1 

2.4.35  (p.  97)  - 


2.4.16  (p.  96)  0 
2.4.20  (p.  96)  e 


2.4.17  (p.  96)  1 
2.4.21  (p.  96)  1 


00 


2.4.27  (p.  96)  0       2.4.28  (p.  96)  0 

2.4.32  (p.  97)  1/120  2.4.33  (p.  97)  00 
00  if  a  <  0,  0  if  a  >  0 
2.4.36  (p.  97)  00    2.4.37  (p.  97)  1     2.4.38  (p.  97)  0    2.4.39  (p.  97)  0 

2.4.40  (p.  97)  0    2.4.41  (p.  97)  (b)  Suppose  that  g'  is  continuous  at  x0  and  / (x)  -- 

g(x)  if  x  <  x0,  f(x)=  1  +  g{x)  if  x  >  x0. 

2.4.44  (p.  97)  (a)  1     (b)  e     (c)  1     2.4.45  (p.  98)  eL 
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Section  2.5    pp.  107-112 

2.5.2  (p.  107)  /("+1)(jt0)/(«  +  l)!.  2.5.4  (p.  107)  (b)  Counterexample:  Letx0  =  0 
and  f(x)  =  x\x\. 

2.5.5  (p.  108)  (b)  Let  g(x)  =  1  +  \x  -  x0\,  so  f(x)  =  (x-  x0)(l  +  \x  -  x0\). 

2.5.6  (p.  108)  (b)  Let  g(x)  =  1  +  \x  -  x0\,  so  f(x)  =  (x-  x0)2(l  +  \x  -  x0\). 

2.5.10  (p.  109)  (b)  (i)  1,  2,  2,  0     (ii)  0,  -it,  3tt/2,  -An  +  jt3/2 

(iii)  -7t2/4,  -2n,  -6  +  n2/A,  4jv     (iv)  -2,  5,  -16,  65 

2.5.11  (p.  109)  (b)  0,-1,0,5 

2.5.12  (p.  110)  (b)  (i)  0,  1,  0,  5  (ii)  -1,  0,  6,  -24  (iii)  V2,  3^2,  llV2, 
57V2  (iv)  -1,  3,  -14,  88  (a)  min  (b)  neither  (c)  min  (d)  max  (e) 
min    (f)  neither    (g)  min    (h)  min 

2.5.14  (p.  110)  f(x)  =  e~1/x2  if  x  ^  0,  f(0)  =  0  (Exercise  2.5.1  (p.  107)) 

2.5.15  (p.  Ill)  None  if  b2  -  4c  <  0;  local  min  at  jcx  =  (-b  +  vV  -4c)/2  and  local 
max  at  x\  =  (—b  —  \lb2  —  Ac) / '2  if  b2  —  4c  >  0;  if  b2  =  4c  then  x  =  —b/2  is  a  critical 
point,  but  not  a  local  extreme  point. 

2.5.16  (p.  Ill)  (a)  -  (— V  (b)  1  (c)  (d) 

VH         '  K  '  6  V20/   v  '  83  v  '  512V2  4(63)4 

2.5.20  (p.  112)  (a)  M3h/3,  where  M3  =  sup|x„c|<A  |/(3)(c)| 
(b)  MAh2 /\2  where  M4  =  sup]x_c|<ft  |/(4)(c)| 

2.5.21  (p.  112)  k  =  -h/2 

Section  3.1    pp.  125-128 

3.1.8  (p.  126)  (b)  monotonic  functions  (c)  Let  [a,b]  =  [0,  1]  and  P  =  {0,  1}.  Let 
/(0)  =  /(l)  =  I  and  f(x)  =  x  if  0  <  x  <  1.  Then  ^(i5)  =  0  and  5(f)  =  1,  but  neither 
is  a  Riemann  sum  of  /  over  P . 

3.1.9  (p.  127)  (a)  i  -\  (b)  i,  1  3.1.10  (p.  127)  eb  -  ea  3.1.11  (p.  127) 
1  -  cos b    3.1.12  (p.  127)  sinft 

3.1.14  (p.  127)  f(a)[gl  -  g(a)]  +  f(d)(g2  -  gi)  +  f(b)[g(b)  -  g2] 

3.1.15  (p.  127)  f(a)[gl  -  g(a)]  +  f(b)[g(b)  -  gp]  +  ££=1  f(am)(gm+1  -  gm) 

3.1.16  (p.  127)  (a)  If  g  =  1  and  /  is  arbitrary,  then  /*  f(x)  dg{x)  =  0. 

Section  3.3    pp.  149-151 

3.3.7  (p.  150)  (a)  u  =  c  =  §     (b)  u  =  c  =  0    (c)  u  =  (e  -  2)/(e  -  1),  c  =  -Jfi 
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Section  3.4  pp.  165-171 
3.4.4  (p.  166) 


(a) 

(i) 

P>2 

(ii)  p  >  0 

(iii)  0 

(b) 

(i) 

P>2 

(ii)  p  >  0 

(iii)  0 

(c) 

(i) 

none 

(ii)  p  >  0 

(iii)  l/p 

(d) 

0; 

p  <  o 

(ii)  0  <  p  <  1 

(iii)  1/(1 

(e) 

(i) 

none 

(ii)  none 

3.4.5  (p.  166)  (a)  n\     (b)  ±     (c)  divergent     (d)  1     (e)  -1     (f)  0 

3.4.8  (p.  166)  (a)  divergent  (b)  convergent  (c)  divergent  (d)  convergent  (e) 
convergent     (f)  divergent 

3.4.9  (p.  166)  (a)  p  <  2  (b)  p  <  1  (c)  p  >  -1  (d)  -1  <  p  <  2  (e)  none 
(f )  none     (g)  p  <  1 

3.4.11  (p.  167)  (a)  p  -  q  <  1     (b)p,q<l     (c)  -1  <  p  <  2q  -  1  (d)  q  >  -1, 


+  q 

>  1      (e)  p  +  q  >  1      (f )  q  +  1 

3.4.12 

(p.  167)  deg 

?  ~  deg  /  >  2 

3.4.18 

(p.  168) 

(a) 

(i)  P  >  1 

(ii)  0  <    <  1 

(b) 

0)  />  >  1 

(ii)  P  <  1 

(c) 

(i)  p  >  1 

(ii)  0  <  p  <  1 

(d) 

(i)  P  >  0 

(ii)  none 

(e) 

(i)  1  <  /><  4 

(ii)  0  <  p  <  1 

(f) 

0)  f  >  I 

(ii)  0  <  p  <  | 

3.4.25 

(p.  169) 

(a) 

{i)p>-\ 

(ii)  -2  <  /?  <  -1 

(b) 

(i)  />  >  -1 

(ii)  none 

(c) 

(i)^<-l 

(ii)  none 

(d) 

(i)  none 

(ii)  none 

(e) 

(i)^<-l 

(ii)  P  >  1 

Section  4.1    pp.  192-195 

4.1.3  (p.  192)    (a)  2    (b)  1     (c)  0    4.1.4  (p.  192)  (a)  1/2    (b)  1/2  (c) 

1/2     (d)  1/2 

4.1.11  (p.  192)  (d)  VI    4.1.14  (p.  193)  (a)  1     (b)  1     (c)  1     (d)  -oo 

(e)0 

4.1.22  (p.  193)  If  s„  =  1  andf„  =  -l/«,then(limn^ooJn)/(lim„^oo^)  =  1/0  =  oo, 
but  linwoo  s„/t„  =  -oo. 

4.1.24  (p.  193)  (a)  oo,  0     (b)  oo,  -oo  if  \r\  >  1;  2,  -2ifr  =  -1;0,  Oifr  =  1;  1, 
—  lif|r|<l     (c)  oo,  —  cjo  if  r  <  —  1;  0,  Oif  \r\  <  1;  \,  ^  if  r  =  1;  oo,    oo  if  r  >  1 
(d)oo,oo  (e)\t\,-\t\ 
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4.1.25  (p.  194)  (a)  1,-1  (b)  2,-2  (c)  3,-1  (c)  VI/2, —v/3/2 
4.1.34  (p.  194)  (b)  lf{s„}  =  {1, 0,1,0,...},  then  linv,-«»  f„  =  \ 


Section  4.2    pp.  199  200 

4.2.2  (p.  199)  (a)  limm^oo  s2m  =  oo,  limm^oo  s2m+i  =  -oo 

(b)  limm^oo  S4m  =  1,  limm^oo  SAm+2  =  —  L  limm->oo  ^2m  +  l  =  0 

(c)  linim^oo  S2m  =  0,  limm^oo  SAm+\  =  L  limm^oo  *4m  +  3  =  —  1 

(d)  Hindoo  i„  =  0  (e)  limm^oo  s2m  =  oo,  limm^oo  s2m+i  =  0 

(f )  limm^oo  S%m  =  linim->cx)  $%m+2  =  L  limm->oo  ■S8m  +  1  =  \^2, 

limm^oo  58m+3  =  limm^oo  58m+7  =  0,  limm^oo  58m+5  =  _  V2, 
limm^oo  5gm+4  =  limm^oo  58m+6  =  —  1 

4.2.3  (p.  199)  {1,2, 1,2,3, 1,2,3,4, 1,2,3,4,5,...} 

4.2.8  (p.  200)  Let  {tn}  be  any  convergent  sequence  and  {s„}  =  {t\,l,t2,2, . . .  ,tn,n, . . . }. 


Section  4.3    pp.  228-234 

4.3.4  (p.  229)  (b)  No;  consider  £  1/n 

4.3.8  (p.  229)  (a)  convergent    (b)  convergent    (c)  divergent    (d)  divergent 
(e)  convergent    (f)  convergent    (g)  divergent    (h)  convergent 

4.3.10  (p.  229)  (a)  p  >  1    (b)  p  >  1    (c)  p  >  1 

4.3.15  (p.  230)  (a)  convergent    (b)  convergent  if  0  <  r  <  1,  divergent  if  r  >  1 
(c)  divergent  (d)  convergent  (e)  divergent  (f )  convergent 

4.3.17  (p.  231)  (a)  convergent    (b)  convergent    (c)  convergent    (d)  convergent 

4.3.18  (p.  231)  (a)  divergent  (b)  convergent  if  and  only  if  0  <  r  <  1  or  r  =  1  and 
p  <  —  1  (c)  convergent    (d)  convergent  (e)  convergent 

4.3.19  (p.  231)  (a)  divergent  (b)  convergent  (c)  convergent  (d)  convergent  if 
a  <  P  —  1 ,  divergent  if  a  >  fi  —  1 

4.3.20  (p.  231)  (a)  divergent    (b)  convergent    (c)  convergent    (d)  convergent 


(-1)"   |  1 


n  log« 


4.3.21  (p.  231)  (a)  £(-1)"    (b)  £(-l)n/".  £ 
(c)E(-l)"2"  (d)E(-i)" 

4.3.27  (p.  232)  (a)  conditionally  convergent  (b)  conditionally  convergent  (c)  abso- 
lutely convergent    (d)  absolutely  convergent 

4.3.28  (p.  232)  Let  k  and  s  be  the  degrees  of  the  numerator  and  denominator,  respec- 
tively. If  \r\  =  1,  the  series  converges  absolutely  if  and  only  if  s  >  k  +  2.  The  series 
converges  conditionally  if  s  =  k  +  1  and  r  =  —  1,  and  diverges  in  all  other  cases,  where 
s  >  k  +  1  and  \r\  =  1. 

4.3.30  (p.  232)  (b)  £(-l)"/V«     41  (p.  ??)  (a)  0     (b)  2A-a0 
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Section  4.4    pp.  253-256 

4.4.1  (p.  253)  (a)  F(x)  =  0,  |x|  <  1    (b)  F(x)  =  0,  |x|  <  1 
(c)  F(x)  =  0,  —  1  <  x  <  1     (d)  F(x)  =  sinx,  — oo  <  x  <  oo 

(e)  F(x)  =  1,  -1  <  x  <  1;  F(x)  =  0,  |x|  >  1     (f)  F(x)  =  x,  -oo  <  x  <  oo 
(g)  F(x)  =  jc2/2.  — oo  <     <  oo     (h)  F(x)  =  0.  — oo  <  x  <  oo 

(i)  F(x)  =  1,  — oo  <  x  <  oo 

4.4.5  (p.  254)  (a)  F(x)  =  0     (b)  F(x)  =  1,  |x|  <  1;  F(x)  =  0,  |jc|  >  1 

(c)  F(x)  =  sinx/x 

4.4.6  (p.  254)  (c)  Fn(x)  =  xn;  Sk  =  [-k/(k  +  \),k/(k  +  1)] 

4.4.7  (p.  254)  (a)  [-1, 1]  (b)  [-r,  r]  U  {1}  U  {-1},  0  <  r  <  1  (c)  [-r,  r]  U  {1},  0  < 
r  <  1 

(d)  [-r,r],  r  >  0     (e)  (-oo,-l/r]  U  [-r,r]  U  [l/r,oo)  U  {1},  0  <  r  <  1 

(f)  [-r,r],  r  >  0     (g)  [-r, r],  r  >  0     (h)  (-oo, -r]  U  [r, oo)  U  {0},  r  >  0 
(i)  [—  r,  r],  r  >  0 

4.4.12  (p.  254)  (b)  Let  5  =  (0, 1],  Fn(x)  =  sin(x/«),  G„(x)  =  l/x2;  then  F  =  0, 
G  =  l/x2,  and  the  convergence  is  uniform,  but  ||  FnGn || s  =  oo- 
4.4.14  (p.  255)  (a)  3     (b)  1     (c)  |     (d)  e  -  1 

4.4.17  (p.  255)  (a)  compact  subsets  of  (— |,oo)     (b)  [—5,00)  (c)  closed  sub- 

(l-V5  1  +  V5\      /JN  .  .  r  ,_N 

sets  of  I  — - — , — - —  I     (d)  (—00,00)  (e)  [r,  00),  r  >  1  (1)  compact  subsets 

of  (-oo,0)  U  (0,oo) 

4.4.19  (p.  255)  (a)  Let  5  =  (—00,00),  /„  =  a„  (constant),  where  ^an  converges 
conditionally, and  gn  =  \an\.    (b)  "absolutely" 

4.4.20  (p.  255)  (a)  (i)  means  that  ^  |  fn{x)\  converges  pointwise  and  ^  fn(x)  con- 
verges uniformly  on  5,  while  (ii)  means  that  £  |/«(^)|  converges  uniformly  on  5. 

00  2«  +  l  00  2«  +  l 

4.4.27  (p.  256)  (a)  Y(-l)n—   (b)  V(-l)"  

„^        n\{2n  +  1)     W  ^0V       (2n  +  l)(2n  +  1)! 

Section  4.5    pp.  275-280 

4.5.2  (p.  276)  (a)  l/3e     (b)  1     (c)  ±     (d)  1     (e)  00 

4.5.8  (p.  276)  (a)  1     (b)  \     (c)  \     (d)  4    (e)  l/e     (f)  1 

4.5.10  (p.  277)  x(l  +  x)/(l  -  x)3     4.5.12  (p.  277)  e"*2 

00  f_1)«-l 

4.5.16  (p.  277)  2     (x  -  1)";  ^  =  1 

00  2«  +  l 

4.5.17  (p.  277)Tan-1;c  =  ^(-1)"  ;  /(2n)(0)  =  0;  /(2"+1)(0)  =  (-1)2(2«)!; 
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6  V3      ^  (2n  +  1)3"+1/2 

°°     X2"  °°      Jt2"  +  1 

4.5.22  (p.  278)  coshx  =  V  ,  sinhjc  =  V  

yv         '  Z^(2«)  ^-£(2n  +  l) 

4.5.23  (p.  278)  (1  -  x)  J2%L0  x"  =  1  converges  for  all  x 

4.5.24  (p.  278)  (a)  x  +  x2  +  ^  -  ^  +  •  •  •    (b)  i -*  -  ^  +  ^!  +  ...  (c) 

1  1  1           (d)  x2  1  1  

2      24       720  v  '  2       6  6 

,     .  .  ,  .  2x  5x4  61xb 

4.5.27  (p.  279)  (a)  1  +  x  +  +  —  +  ■•■    (b)  1  -  x  +  +  •  •  •  (c)  1  +  —  +  +  +  ■  ■  ■ 

2        \  6      3        3  V  '       6      2        2  w        2       24  720 

(d)  H  1  1  1  (e)  2  —  x  -\  1  

v  '         6      360      15120         y  '  12  360 

5  3  ?  °° 

4.5.28  (p.  279)  FW  =     _         +       =1^  +  1^  =  ^P"+'  "  <-2)"+ V 

v  v  '  «=0 

4.5.29  (p.  279)  1 

Section  5.1    pp.  299-302 

5.1.1  (p.  299)  (a)  (3, 0,  3,  3)     (b)  (-1,-1,4)     (c)  (±,  {§,  §§,  A) 

5.1.3  (p.  299)  (a)  Vl5     (b)  V65/12     (c)  V3T     (d)  V3 

5.1.4  (p.  299)  (a)  V89     (b)  vT66/12     (c)  3     (d)  VJT 

5.1.5  (p.  299)  (a)  12     (b)  i     (c)  27 

5.1.7  (p.  299)  X  =  X0  +  fU  (-00  <  t  <  oo)  in  all  cases. 

5.1.8  (p.  299)  . . .  U  and  Xi  -  X0  are  scalar  multiples  of  V. 

5.1.9  (p.  299)  (a)  X  =  (1.-3, 4, 2)  +  t(l,  3,  -5,  3) 

(b)  X  =  (3, 1,-2, 1,4,)  +  f(-l,-l,  1,3,-7) 

(c)  X=  (1, 2, -l)  +  f (-1,-3,0) 

5.1.10  (p.  300)  (a)  5     (b)  2     (c)  1/2^5 

5.1.11  (p.  300)  (a)  (i)  Uxi,  x2,  X3,  x4)  |  |x,-|  <  3  (z  =  1,  2.  3)  with  at  least  one  equality} 
(ii)  {(xi,x2,x3,X4)  |  \xt  \  <  3  (i  =  1,2,3)}     (iii)  S 

(iv)  {(x\,  X2,  Xi,  X4)  I  \xi  I  >  3  for  at  least  one  of  i  =  1,  2,  3} 
(b)(i)S     (ii)  S    (iii)  0     (iv)  {(x,  y,  z)  \  z,  ^  1  or  x2  +  y2  >  1} 

5.1.12  (p.  300)  (a)  open  (b)  neither  (c)  closed 

5.1.18  (p.  300)  (a)  (n,  1,0)     (b)  (1,0,  e) 

5.1.19  (p.  300)  (a)  6     (b)  6     (c)  2y/5     (d)  2L^n     (e)  00 
5.1.29  (p.  302)  {(x,y)\x2  +  y2  =  1} 
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5.1.33  (p.  302)  ...  if  for  A  there  is  an  integer  R  such  that  |Xr  |  >  A  if  r  >  R. 


Section  5.2    pp.  314-316 

5.2.1  (p.  314)  (a)  10    (b)  3     (c)  1     (d)  0     (e)  0     (f)  0 

5.2.3  (p.  315)  (b)  a/{\  +  a2) 

5.2.4  (p.  315)  (a)  oo     (b)  oo     (c)  no     (d)  -co     (e)  no 

5.2.5  (p.  315)  (a)  0     (b)  0     (c)  none     (d)  0     (e)  none 

5.2.6  (p.  316)  (a)  ...if  Df  is  unbounded  and  for  each  M  there  is  an  R  such  that 
/(X)  >  M  if  X  €  Df  and  |X|  >  R.  (b)  Replace  ">  M"  by  "<  M"  in  (a). 

5.2.7  (p.  316)  limx^o  /  (X)  =  0  if  a\  +ci2-\  Yan  >  b;  no  limit  if  a  i  +a.2-\  \-an  < 

b  and  a2  +  a|  +  \-  a2  ^  0;  linix^o  / (X)  =  co  if  a\  =  ai  =  ■■■  =  an  =  0  and  b  >  0. 

5.2.8  (p.  316)  No;  for  example,  lirn^oo  g(x,  -Jx)  =  0. 

5.2.9  (p.  316)  (a)  R3  (b)  R2     (c)  R3     (d)  R2     (e)  {(x,  y)  \  x  >  y)     (f)  W 

5.2.10  (p.  316)  (a)  M3  -  {(0, 0, 0)}     (b)  R2     (c)  R2     (d)  R2     (e)  R2 

5.2.11  (p.  316)  f(x,  y)  =  xy/(x2  +  y2)  if  (x,  y)  ^  (0,  0)  and  /(0,  0)  =  0 


Section  5.3    pp.  335-339 

5.3.1  (p.  335)  (a)  ^-(x  +  y^x-xy  sinx)  -  2^|(x  cosx)  (b)  1  e~x+y2+2z 
2 

(c)  —  (xi  +  x2  +  ■  ■  ■  +  x„)  (d)  1/(1  +  x  +  y  +  z.) 
■s/n 

5.3.2  (p.  335)  (p\<p2     3  (p.  ??)  (a)  -5tt/V6     (b)  -2e     (c)  0     (d)  0 

5.3.5  (p.  335)  (a)  fx  =  fy  =  1/(jc  +  y  +  2z),  /z  =  2/(x  +  y  +  2z) 

(b)  /*  =  2x  +  3yz.  +  2y,  fy  =  3xz.  +  2x,  fz  =  3xy  (c)  /,  =        /,  =  xz.ey^, 
fz  =  xyeyz  (d)  fx  =  2xycosx2y,  fy  =  x2cosx2y,  fz  =  1 

5.3.6  (p.  335)  (a)  fxx  =  fyy  =  fxy  =  fyx  =  -l/(x  +  y+  2z.)2,  fxz  =  fzx  = 
fyz  =  fzy  =  -2/(x  +  y  +  2z)2,       =  -4/(x  +  y  +  2z,)2 

(b)  fXX    =  2,    fyy    =   fzz    =  0,  fXy    =   fyx    =  3 Z,  +  2,  fXZ    =   fZX    =  3 J ,  fyz    =    fZy    =  3X 

(c)  fxx  =  0,  fyy  =  xz2ey\  fzz  =  xy2ey\  fxy  =  fyx  =  z,ey\  fxz  =  fzx  =  ye**, 
fyz  =  fzy  =  xeyz 

(d)  fxx  =  2ycosx2y  -  Ax2y2  sinx2y,  fyy  =  -x4sinx2j,  fzz  =  0,  fxy  =  fyx  = 
2xcosx2y  -  2xly  sinx2j,  fxz  =  fzx  =  fyz  =  fzy  =  0 

5.3.7  (p.  336)  (a)  fxx(0,0)  =  fyy(0,0)  =  0,  fxy(0,0)  =  -1,  fyx(0,O)  =  1 
(b)  fxx(0, 0)  =  fyy(0, 0)  =  0,  fXy(0,  0)  =  -1,  fyx(0, 0)  =  1 

5.3.8  (p.  336)  f(x,  y)  =  g(x,  y)  +  h(y),  where  gxy  exists  everywhere  and  h  is  nowhere 
differentiable. 
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5.3.18  (p.  337)  (a)  df  =  (3x2  +  <\y2  +  2ysinx  +  2xycosx)  dx  +  (%xy  +  2xsmx)  dy, 
dXof  =  16  dx,(dXof)(X- X„)  =  16x 

(b)  df  =  —e~x~y~z  (dx  +  dy  +  dz),  dxQf  =  —dx  —  dy  —  dz, 
(dXof)(X-X0)  =  -x-y-z 

(c)  df  =  (1  +  xx  +  2x2  H  h  E"=i  7        d*of  =  E"=i  7  <£*7> 

(rfXo/)(X-X0)  =  E"=i7^. 

(d)  rf/  =  2r|X|2r-2E}=i  *y  ^7-  ^xo/  =  2rn'-1  £"=1  djcy, 
(^x0/)(X-Xo)  =  2r«'-1E"=i(^  -  1). 

5.3.19  (p.  337)  (b)  The  unit  vector  in  the  direction  of  (fxi  (X0),  fX2(X0), /x„(X0)) 
provided  that  this  is  not  0;  if  it  is  0,  then  3/(Xo)/3$  =  0  for  every  <I>. 

5.3.24  (p.  338)  (a)  z  =  lx  +  Ay-6  (b)  z  =  2x  +  3j  +  l  (c)  z.  =  {nx)/2+y-n/2 
(d)  z.  =  x  +  lOy  +  4 


Section  5.4    pp.  356-360 

5.4.2  (p.  357)  (a)  5du  +  34  dv     (b)  0     (c)  6du  -  18  dv  (d)8du 

5.4.3  (p.  357)  hr  =  fx  cos  9  +  fy  sin6»,  he  =  r(-fx  sin9  +  fy  cos  9),  hz  =  fz 

5.4.4  (p.  357)  hr  =  fx  sin</>  cos  9  +  fy  sin0  sin(9  +  fz  cos  4>,h$  =  r  sin</>(— /x  sin  9  + 
fy  cos  0),  11$  =  r(/x  cos  0  cos  9  +  fy  cos  0  sin  ^  —  fz  sin0) 

5.4.6  (p.  357)  hy  =  gxxy  +  gy  +  gwwy,  hz  =  gxxz  +  gz  +  gwwz 

5.4.13  (p.  358)  hrr  =  fxx  sin2  0  cos2  0  +  fyy  sin2  </>  sin2  9  +  fzz  cos2  (p+fxy  sin20sin26l  + 

fyz  sin  20  sin  9  +  fxz  sin  20  cos  9, 

r 

2' 


(— /c  sin#  +  /y  cos  0)  sin0  +  -{fyy  —  fxx)  sin2  <p  sin 20  +  r/Vy  sin2  (p  cos  20  + 


r 

-  (/Z3,  cos  6»  -  fzx  sin  6> )  sin  2<j> 

5.4.16  (p.  358)  (a)  1  +  x  +  —  -      +  f!  _  fZ! 
VH         y  w  2       2       6  2 

2  2  3  2  2  3 

,,  >  x  y      x      x  y     xy  y 

(b)  l-x-y  +  —  +  xy  +  -  

v;  /2/262  26 

(c)  0     (d)  xyz 

5.4.21  (p.  359)  (a)  (dl0  0)p)(x,  y)  =  {dfmq){x,  y)  =  2(x  -  y) 


2 


Section  6.1    pp.  376-378 


6.1.3  (p.  376)  (a) 


6.1.4  (p.  376)  (a) 


3  4  6 
2-4  2 
7      2  3 


(b) 


4 

-2 

-4 
1 


8      8    16  24 
0     0     4  12 
12    16    28  44 


(b) 


-2    -6  0 
0    -2  -4 
-2      2  -6 
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6.1.5  (p.  376)  (a) 

6.1.6  (p.  376)  (a) 


(b) 


29 
50 


-2  2  6 
6  7-3 
0-2  6 

13  25  " 

16  31  (b) 

16  25  _ 

6.1.10  (p.  377)  A  and  B  are  square  of  the  same  order. 


-1  7 
3  5 
5  14 


6.1.12  (p.  377)  (a) 

6.1.13  (p.  377) 


7  3  3 
4  7  7 
6    -9  1 


(b) 


-7    6  4 
-9    7  13 
5    0  -14 


14  10 
6  -2 
14  2 

-5  6  0 
4  -12  3 
4        0  3 


6.1.15  (p.  377)  (a)  [  6xyz    3xz2    3x2y  ];  [  -6    3    -3  ] 

(b)  cos(jc  +  y)[  1    1  ];  [  0    0  ] 

(c)  [  (1  -  jcz)je~xz    xe"xz    -jtV"*2  ];[  2    1    -2  ] 


(d)  sec20  +  2y  +  z.)  [  1  2  1  ];  [  2  4  2  ] 
(ejixr^xi    x2  ■■■ 


1  ] 


6.1.20  (p.  377)  (a)  (2,  3,  -2)     (b)  (2,  3, 0)     (c)  (-2, 0,-1)     (d)  (3, 1,  3, 2) 

-1       1  2 


6.1.21  (p.  378)  (a) 


1 

To 


(c) 


25 


(e)  - 


3 

-5 

-8 

5 

4 

10 

2 

0 

0 

1 

0 

0 

0 

2 

-3 

0 

1 

2 

4  2 
-3  1 


(d) 


(b) 


1 


3 
-1 

1 
1 

-1 


(f)  — 
v  '  10 


-1 
-14 
21 
17 


-2 
-18 
22 
24 


0 
10 

-10 
-10 


5 
20 
-25 
-25 


Section  6.2    pp.  390-394 
6.2.12  (p.  392)  (a)  F'(X)  = 

yz,exyz  xzex 
/F(X)  =  exyz  sin(;c  +  y  +  z.)[x(l  -  2x)(y  -  z)  -  z.{x  -  y)\, 


2x  1 
sin(x  +  y  +  z.)    -  sin(x  +  y  +  z.) 


sm(x  +  y  +  z.) 

xyexyz 
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"  0  " 

G(X)  = 

1 

+ 

1 

(b)  F'(X)  = 
G(X)  = 


2  1  2 

0  0  0 

0  0-1 

cos  y  —  ex  siny 
ex  sin  j  excos_y 


x  -  1 

y  +  i 

z 


/F(X) 


,2x. 


+ 


0 

1 


x 

y-n/2 


(c)  F'(X)  = 


2x    -2y  0 
0      2y  -2z 
-2x        0  2z 


/F  =  0; 


G(X) 


2 
0 

-2 


x-  1 
z-  1 


6.2.13  (p.  392)  (a)  F'(X)  = 
(b)  F'(X)  = 


(x  +  y  +  z  +  l)ex  ex 
(2x  -  x2  -  y2)e~x  2ye~x 


(c)  F'(r,  0)  = 


ex  sinjz 


ze  cosjz    ye  cos yz 


z.ey  cosxz.         smxz  xe^cosxz. 


y , 


ye^cosxy    xe^cosxy  smxy 
6.2.14  (p.  392)  (a)  F'(r,  0)  = 


cos  y  —  r  sin  I 
sin  0      r  cos  I 


;  /F(r,  9)  =  r 


COS  0  COS  0 

-r  sin  (9 

COS  0 

—r  cos  9  sin0 

(b)  ¥'(r,9,<f>)  = 

sin  (9  cos^ 

r  cos  9 

COS  (p 

— r  sin  (9  sin0 

sin^ 

0 

r  cos  0 

/F(r,  0, 0) 

=  r2  cos  (p 

cos  0    —  rsin£ 

>    0  " 

(c)  F'(r,0,z)  = 

sin  9       r  cos  ( 

9  0 

■  J¥(r, 

9,  z)  =  r 

0  0 

1 

6.2.20  (p.  393)  (a) 

fA\  r  4  -3  1 
<d)     0  11 


0 
0 

(e) 


0  4 


2  0 
2  0 


(b) 
(f) 


-18  0 
2  0 


(c) 


5 
9 

-4 


10 
18 
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Section  6.3    pp.  414-417 


6.3.4  (p.  414)  (a)  [1,tt/2]  (b)  [1,2tt]  (c)  [1,jt]  (d)  [2>/2,  9jt/4]  (e) 
[V2,  3tt/4] 

6.3.5  (p.  414)  (a)  [1,-3jt/2]  (b)  [1,  — 2jt]  (c)  [1,-jt]  (d)  [2V5, -7tt/4] 
(e)  [VX  -5jt/4] 

6.3.6  (p.  414)  (b)  Let  /(*)  =  x  (0  <  x  <  \),  f(x)  =  x  -  ~  (|  <  x  <  1);  then  /  is 
locally  invertible  but  not  invertible  on  [0,  1]. 

6.3.7  (p.  414)  F(S)  =  {(w,  u)  |  -  7r  +  20  <  arg(w,  u)  <  n  +  2<p),  where  4>  is  an  argu- 
ment of  (a,  b); 

¥^l(u,v)  =  (u2  +  v2)l'A 


cos(arg(w,  v)/2) 
sin(arg(w,  v)/2) 


6.3.10  (p.  415)  (a) 
(b) 


X 

l 

u  —  2v 

_  y  _ 

"  To 

3u  +  4v 

2(p  —  n  <  arg(w,  v)  <  2(p  +  jt 
1 


(F~  ) 


IV  - 


X 

1 

y 

~~  2 

z 

m  +  2u  +  3w 

u  —  w 
u  +  v  +  2w 

1 


6.3.12  (p.  415)  Gi  (»,«)  =  — 

v  2 


(F~iy 

V"  +  v 
*Ju  —  V 


1 


,G[(u,v)  = 


10 

1  2  3 
1  0  -1 
1    1  2 

1 


G2(M,  1))  = 

G3(w, «)  = 


1 

i 

l 


G4(m,  v)  =  — 
V2 


— «Ju  +  V 
*Ju  —  V 

V"  +  V 

— VM  —  v 

—  y/U  +  V 

— VM  —  v 


,  G'2(u,  v)  = 
,G'3(u,v)  = 
,  G'(u,  v)  = 


2^2 
1 


2V2 

—  1/VM  +  v  — 1/VM  +  v 
2^2  I     1/V"  -  v  -1/V"  -  v 

1  r  i/V"  +  t>  i/VM  +  ^ 

—  1/  VM    _   U  1  /  y/U    —  V 

— l/V"  +  «  — 1/V1'  + t> 

—  1/VM  —  U  1  /  -y/W  —  U 


1  -2 
3  4 


1  /  VM  +  ^     i/VM  +  u 

1/s/u  —  v    —  1/  V"  —  i> 


4V"""  2V2 

6.3.15  (p.  416)  From  solving  .t  =  rcos#,}>  =  r  sin  (9  for  #  =  arg(.r,;y).  Each  equation 
is  satisfied  by  angles  that  are  not  arguments  of  (x,  y),  since  none  of  the  formulas  identifies 
the  quadrant  of  (x,  y)  uniquely.  Moreover,  (c)  does  not  hold  if  x  =  0. 


6.3.16  (p.  416) 


x 

y 


=  G(u,v)  =  (u2  +  v2)1'4 


cos[i  arg(i/,  vy] 
sin(arg(w,  v)/2) 


where  /3  —  n/2  <  arg(w,  v)  <    +  it/2  and  /3  is  an  argument  of  (a,  b); 


1 


x  y 

-y  x 


G'(tt,u)  =  .  , 

2(x2  +  y2) 

6.3.19  (p.  416)  If  F(xi,x2, . . .  ,x„)  =  (x\ ,  x\, x%),  then  F  is  invertible,  but 
/F(0)  =0. 

6.3.20  (p.  416)  (a)  A(U)  = 


1  " 

1 

"55" 

u  +  5 

-1 

~  25 

3    8  _ 

v  —  4 
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(b)  A(U) 

(c)  A(U)  : 

(d)  A(U) 


1 
1 

0 
1 
1 

1 

7l/2 
71 


+ 


+ 


+ 


4  -2 

-3  3 

0  -1  1 
-1  1  0 

1  0  0 

0-1  0 

1  0  0 

0  0-1 


u  —  2 
v-3 

u  —  1 

V  -  1 

w  —  2 


u 

V  +  1 

w 


6.3.21  (p.  417)  G'(x,y,z)  = 


cos  9  cos 
sin  (9 


sin  9  cos  0 
cos  9 


0 


r  cos  0  r  cos  0 

1  1  1 

—  cos  9  sin  <fi  — sin  9  sin  <fi  -coscp 

r  r  r 


6.3.22  (p.  417)  G'(x,y,z)  = 


COS  I 

1 


sin  f 


0 


10  0 


1 


•  cos  9  0 


Section  6.4  pp.  431-434 
6.4.1  (p.  431)  (a) 


(b) 


it 
v 
w 


(d)  u  =  —x,  v  =  —y,  z  =  —i 


u 

1 

"  -3 

4  " 

X 

V 

2 

1 

-2 

3 

3  " 

-1 

2 

(c) 

u 

2 

3 

V 

-y  +  sin.T 
-jc  +  sin  y 


6.4.3  (p.  431)  /f(X,U)  =  I  y^  a,7(xj  -xy0)  I   -      -  ";o)s,  1  <  i  <  m,  where  r 

and  s  are  positive  integers  and  not  all  ay  =  0.  (a)  r  =  s  =  3;  (b)  r  =  1,  J  =  3;  (c) 
r  =  ^  =  2 

6.4.4  (p.  431)  11,(1, 1)  =  -f ,  11,(1, 1)  =  -\ 

6.4.5  (p.  431)  ux(l,  1, 1)  =  §,  «,(1, 1, 1)  =  -§,  uz(l,  1, 1)  =  \ 

6.4.6  (p.  431)  (a)  w(l,2)  =  0,  wx(l,  2)  =  uy(l,2)  =  -4 

(b)  u{-\,  -2)  =  2,  M,(-l,  -2)  =  1,  wy  (-1,  -2)  =  -i 

(c)  u(it/2,  it/2)  =  ux(jt/2,  n/2)  =  Uy{n/2,  Jt/2)  =  0 

(d)  u(l,  1)  =  1,  UX(1,  1)  =  Uy(l,  1)  =  -1 


6.4.7  (p.  431)  (a)  ui(l,  1)  =  1, 


9wi(l,  1) 
3x 


3wi(l,  1) 

3y 


562    Answers  to  Selected  Exercises 


9n2(l,l)  3M2(1,1) 
w2(l,  1)  =  2,   =  —14;   =  —2 


dx 


dy 


ru\      rn    ^      /"»/    ,  i\    /o     9Mfe(°.jr)      A  9w*(0,tt) 
(b)  w/t(0,  it)  =  (2k  +  l)7r/2,   =  0,   =  —1,    k  =  integer 


6.4.8  (p.  432) 


6.4.10  (p.  432)  - 


-1  -2  1 
-1    -2  1 

5  5 
-5  -5 

6  6 


dx  dy 

6.4.9  (p.  432)  u'(0)  =  3,  v'(0)  =  -1 


6.4.11  (p.  432)  Ui(l,  1) 


U2(l,l)  =  - 


,U'(1,1)  =  - 


u;(i,i) 

1  3 
-1  2 


1  3 
-1  2 


6.4.12  (p.  432)  ux(0, 0, 0)  =  2,  vx(0, 0, 0)  =  wx(0, 0, 0)  =  -2 


3(/.ff,A) 


6.4.13  (p.  433)  yx  =  -|t^4>  yv  = - 


3(f,g,h) 
_  d(y,v,u) 
Zv--d(f,g,h)'U> 
d(y,  z,  u) 

6.4.14  (p.  433)  x 


d(y,  z,  u) 
B(f,g,h) 
d(y,  z,  x) 
~d(J,g,h)-- 
d(y,  z,  u) 


d(f,g,h) 
d(v,  z,  u) 
d(f,g,h) 
d(y,  z,  u) 

Bjf.g.h) 
d(y,  z,  v) 
Hf^g.h) 
d(y,z,u) 


,zx  =  - 


Hf,g,h) 
d(y,  x,  u) 

mg,hy 

d(y,z,u) 


-2y  —  u,  z  =  —2d;  x  =  —2y  —  u,  v  =  —  — ;  y  =  - 

X      u  z 

Z.  =  —2v:  y  =  ,  v  =  — ;  z,  =  —2v.  u  =  —x  —  2y:  u  =  —x  —  2y,  v  = 

2     2  2 


6.4.15  (p.  433)  yx(l,  -1,-2)  =  -1,  -2)  =  1 

6.4.16  (p.  433)  uw(0,-l)  =  |,Mj,(0,-l)  =  0,  vw(0,-l)  =  -§,  vy(0,-l)  =  0, 
x„(0,-l)  =  l,*y(0I-l)  =  -l 

6.4.18  (p.  434)  ux(l,  1)  =  0,  uy(l,  1)  =  0,  vx(l,  1)  =  -1,  vy(l,  1)  =  -1,  uxx(l,  1) 
2, 

u„(l,  1)  =  1,  uyy(l,  1)  =  2,  1)  =  -2,  vxy{\,  1)  =  -1,  ww(l,  1)  =  -2 

6.4.19  (p.  434)         -1)  =  0,  uy(l,  -1)  =  ^  «,(1,  -1)  =  ~,  vy(l,  -1)  =  0, 

UXX{\,-\)  =  -~,  M,y(l,-1)  =   -,  Kj,y(l,-1)  =   -,  D^(l,-1)  =  --, 
1)^(1,-1)  =  ~,  Vyy(l,  -1)  =  - 
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Section  7.1    pp.  459  462 

7.1.2  (p.  459)  (a)  28    (b)  \     6  (p.  ??)  3(b  -  a)(d  -  c),  0     13  (p.  ??) 

{(m,  n)\m,n  =  integers} 

Section  7.2    pp.  480-484 

7.2.1  (p.  480)  (a)  12     (b)  g>     (c)  -1     (d)  (1  -  log2)/2 

7.2.5  (p.  481)  (a)  2     (b)  17     (c)  §  1)     (d)  1/4* 

7.2.7  (p.  481)  (a)  |,  f  (b)  f ,  f  7.2.8  (p.  482)  (a)  f ,  f  (b)  §  (z  +  ±), 
§(z  +  2)    (c)z  +  il 

7.2.11  (p.  482)  (a)  -285     (b)  0     (c)  0     (d)  \(e  -  f ) 

7.2.12  (p.  483)  (a)  324     (b)  2     (c)  1     7.2.13  (p.  483)  f| 
7.2.14  (p.  483)  (a)  36     (b)  1     (c)  M     (d)  (e6  +  17)/2 

7.2.17  (p.  483)  (a)  £     (b)  \{e  -  f )     (c)  ±     (d)  ^ 

7.2.18  (p.  483)  (a)  16*     (b)  ±     (c)  4f     (d)  f 

7.2.19  (p.  484)  (a)        -ci)  •••  (bn  -  a„)  E"=i(«y  +  bj) 

(b)  -  at)  ■  ■  ■  (bn  -  an)  E"=i  (flj  +  « A'  +  ^2) 

(c)  2-"{b\-a\)---{b2n-a2n) 

7.2.20  (p.  484)  /_^2  dx  ffi**  f(x,  y)  dy     7.2.22  (p.  484)  \ 
Section  7.3    pp.  514-517 

7.3.1  (p.  514)  Let  Si  and  S2  be  dense  subsets  of  R  such  that  5i  U  52  =  K. 
7.3.7  (p.  514)  (a)  -1;  c  (constant);  1     9  (p.  ??)  (w2  -  ui)(v2  -  Vi)/\ad  -  bc\ 
7.3.10  (p.  515)  f     7.3.14  (p.  515)  (a)  |     (b)  logf     7.3.15  (p.  516)  3 
7.3.16  (p.  516)  2    7.3.17  (p.  516)  f  e(e  -  1) 

7.3.18  (p.  516)  \nabc     7.3.19  (p.  516)  2n(e25  -  e9)     7.3.20  (p.  516)  16tt/3 

7.3.21  (p.  516)  21/64 

7.3.22  (p.  516)  (a)  (jr/8)log5     (b)  (*/4)(e4  -  1)     (c)  2^/15 

7.3.23  (p.  517)  it2a4/2 

7.3.24  (p.  517)  (a)  (ft         •••(&,  -or*)/|det(A)|     25  (p.  ??)  \axa2  ■  ■  -an\V„ 
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A 

Abel's  test,  219 

Abel's  theorem,  273,  279 

Absolute  convergence,  215 

of  an  improper  integral,  160 

of  a  series  of  constants,  215 

of  a  series  of  functions,  247 
Absolute  integrability,  160 
Absolute  uniform  convergence,  247,  255 
(Exercises  4.4.17  and  4.4.20), 
256  (Exercise  4.4.21) 

of  a  power  series,  257 
Absolute  value,  2 
Addition  of  power  series,  267 
Adjoint  matrix,  370 
Affine  transformation,  380 
Alternating  series,  203 

test,  203,219 
Analytic  transformation,  4 1 6  (Exercise  6.3.17) 
Angle  between  two  vectors,  286 
Antiderivative,  143,  150  (Exercise  3.3.16) 
Archimedean  property,  5 
Area  under  a  curve,  116 
Argument,  398 

branch  of,  409, 410, 415  (Exercise  6.3.14) 
Ascoli-Arzela  theorem,  541 
Associative  laws 

for  the  real  numbers,  2  (see  p.  1) 

for  vector  addition,  283 

B 

Bessel  function,  277  (Exercise  4.5.11) 


Binomial  coefficient,  17  (Exercise  1.2.19), 
102,  194  (Exercise  4.1.35) 

Binomial  series,  266 

Binomial  theorem,  17  (Exercise  1.2.19) 

Bolzano-Weierstrass  theorem,  27,  294, 
301  (Exercise  5.1.22) 

Bound 

lower,  7 

upper,  3 
Boundary,  526 

point,  289,  526 

of  a  set,  23,  289 
Bounded  convergence  theorem,  243 
Bounded  function,  47,  60,  313 
Boundedness  of  a  continuous  function 

on  a  closed  interval,  62,  199 

on  a  compact  set,  313 
Boundedness  of  an  integrable  function, 
119 

on  a  metric  space,  536 
Bounded  sequence,  181,  197,  292 
Bounded  set 

above,  3,  313 

below  7,  313 
Bounded  variation,  134—135  (Exercises  3.2.7, 
3.2.9,3.2.10) 

Branch 

of  an  argument,  409,  415 
of  an  inverse,  409 

C 

C[a,b],521 

equicontinuous  subset  of,  539 
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uniformly  bounded  subset  of,  539 
Cartesian  product,  31,  435 
Cauchy  product  of  series,  226,  233  (Ex- 
ercise 4.3.40),  280  (Exercise  4.5.32) 
Cauchy  sequence,  527 
Cauchy's  convergence  criterion 

for  sequences  of  real  numbers,  190 

for  sequences  of  vectors,  292 

for  series  of  real  numbers,  204 
Cauchy's  root  test,  215 
Cauchy's  uniform  convergence  criterion 

for  sequences,  239 

for  series,  246 
Chain  rule,  77,  340,  388 
Change  of  variable,  145,  147 

in  an  improper  integral,  164 

in  a  multiple  integral,  496 

formulation  of  the  rule  for,  494 

in  an  ordinary  integral,  145,  147 
Changing  the  order  of  integration,  478 
Characteristic  function,  70  (Exercise  2.2.9), 
485 

Closed 

under  scalar  multiplication,  519 

under  vector  addition,  519 
Closed  interval,  23 
Closed  n  -ball,  291 
Closed  set,  21,  289,  525 
Closure  of  a  set,  23,  289 
Cofactor,  370 

expanding  a  determinant  in,  371-372 
Commutative  laws 

for  the  reals,  2  (See  p.  1) 

for  vector  addition,  283 
Compact  set,  20,  293,  536 
Comparison  test 

for  improper  integrals,  156 

for  series,  206 
Complement  of  a  set,  20 
Complete  metric  space,  527 
Completeness  axiom,  4 
Complete  ordered  field,  4 
Component  function,  311 
Components,  284  (see  p.  281) 

of  a  vector- valued  function,  311,  362 


Composite  function,  58,  311 

continuity  of,  59,  311 

differentiability  of,  77,  340 

higher  derivatives  of,  345 

Taylor  polynomial  of,  109-1 10 
(Exercise  2.5.1 1) 
Composition  of  functions,  58 
Conditional  convergence 

of  an  improper  integral,  162 

of  a  series,  217 
Conditionally  integrable,  162 
Connected  metric  space,  547  (Exercise  8.3.2) 
Connected  set,  295 

polygonally,  296 
Containment  of  a  set,  19 
Content,  453 

of  a  coordinate  rectangle,  437 

of  a  set,  485 

zero,  448,  514  (Exercise  refexer:7.3.2) 
Continuity,  54,  302 

of  a  composite  function,  59,  311 

of  a  differentiable  function,  76,  325 

of  a  function  of  n  variables,  309 

of  a  function  of  one  variable,  54 

on  an  interval,  55 

from  the  left,  54 

of  a  monotonic  function,  67 

piecewise,  56 

from  the  right,  54 

on  a  set,  56,  311 

of  a  sum,  difference,  product,  and 
quotient,  57,  311 

in  terms  of  sequences,  198 

of  a  transformation,  379 

uniform,  64, 66,  314,  392  (Exercise  6.2.10) 

of  a  uniform  limit,  242 

of  a  uniformly  convergent  series,  250 
Continuous  function  54,  309 

boundedness  of,  62,  313 

extreme  values  of  on  a  closed  inter- 
val, 62 

integrability  of,  133 

intermediate  values  of,  63,  313 

on  a  metric  space,  543 
Continuous  transformation,  379 
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Continuously  differentiable,  73,  80,  329, 
385,  409 

Contraction  mapping  theorem,  545 
Convergence 
absolute 

of  an  improper  integral,  1 60 

of  a  series  of  constants,  215 
absolute  uniform,  247 
conditional 

of  a  series,  217 

of  an  improper  integral,  162 
of  an  improper  integral,  152 
of  an  infinite  series,  201 
interval  of,  258 
pointwise 

of  a  sequence  of  functions,  234, 
238 

of  a  series  of  functions,  244 
of  a  power  series,  257 
radius  of,  258 

of  a  sequence  in  a  metric  space,  526 

of  a  sequence  in  W ,  292 

of  a  sequence  of  real  numbers,  179 

of  a  series  of  constants,  200 

of  a  sum,  difference,  or  product  of 
sequences,  184 

of  a  Taylor  series,  264 

uniform,  246 

of  a  sequence,  237 
of  a  series,  246 
Coordinate  cube,  437 

degenerate,  437 

nondegenerate,  437 
Coordinate  rectangle,  437 
Coordinates, 

polar,  397,  502,  505 

spherical,  507 
Covering,  open,  25,  293,  535 
Cramer's  rule,  373 
Critical  point,  81,335 
Curve,  differentiable,  453 

D 

Decreasing  sequence,  182 
Dedekind  cut,  9  (Exercise  1.1.8) 


Dedekind's  theorem,  9  (Exercise  1.1.8) 

Defined  inductively,  12 

Degree 

of  a  homogeneous  polynomial,  352 

of  a  polynomial,  98 
Deleted  € -neighborhood,  22 
Deleted  neighborhood,  525 
Dense  set,  6,  29  (Exercise  1.3.22),  70  (Ex- 
ercise 2.2.10) 
Density  of  the  rationals,  6,  392  (Esercise  6.2.1 1) 
Density  of  the  irrationals,  6 
Denumerable  set,  176 
Derivative,  73 

of  a  composite  function,  77 

directional,  317 

infinite,  88  (Exercise  2.3.26) 

of  an  inverse  function,  86  (Exercise  2.3.14) 

left-hand,  79 

nth,  73 

one-sided,  79 

ordinary,  317 

partial,  317 

of  a  power  series,  261-262 
right-hand,  79 
rth  order,  319 
second,  73 

of  a  sum,  difference,  product,  and 
quotient,  77 

zeroth,  73 
Determinant,  368  (see  p.  369) 

expanding  in  cofactors,  371-372 

of  a  product  of  square  matrices,  370 
Diameter  of  a  set,  292,  586 
Difference  quotient,  73 
Differentiability 

of  a  composite  function,  340 

continuous,  329 

of  a  function  of  one  variable,  73 
of  a  function  of  several  variables,  323 
of  the  limit  of  a  sequence,  243 
of  a  power  series,  260-262 
of  a  series,  252 
Differentiable  73,  323 

continuously,  73,  80,  409 
curve,  453 
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function,  continuity  of,  76,  325,  385 
on  an  interval,  80 
on  a  set,  73 
surface,  453 
transformation,  380 
vector-valued  function,  339 
Differential,  326 
higher,  348 

of  a  linear  transformation,  367 

matrix,  367,  381 

of  a  real-valued  function,  326 

of  a  sum,  difference,  product,  and 

quotient,  328 
of  a  transformation,  381 
Differential  equation,  170-171 

(Exercises  3.4.27-3.4.29) 
Directional  derivative,  317 
Dirichlet's  test 

for  improper  integrals,  163 
for  series  of  constants,  217 
for  uniform  convergence  of  series, 
248 

Disconnected  set,  295 
Discontinuity 

jump,  56 

removable,  58 
Discrete  metric,  519 
Disjoint  sets,  20 
Distance 

in  a  metric  space,  518 

from  a  point  to  a  set,  301 
(Exercise  5.1.24) 

between  subsets  of  a  metric  space, 
547  (Exercise  8.3.3) 

between  two  sets,  301 
(Exercise  5.1.25) 

between  two  vectors,  283 
Distributive  law,  2  (see  p.  1) 
Divergence,  unconditional,  233 

(Exercise  4.3.38) 
Divergent  improper  integral,  152 
Divergent  sequence,  179 
Divergent  series,  201 
Domain  of  a  function,  31  (see  p.  30),  543 
Double  integral,  438 


E 

Edge  lengths  of  a  coordinate  rectangle, 

437 

Elementary  matrix,  488 
Empty  set,  4 
Entries  of  a  matrix,  364 
e-neighborhood,  21,  289,  525 
e-net,  538 

Equicontinuous  subset  of  C[a,  b],  539 

Equivalent  metrics,  529 

Error  in  approximating  derivatives,  112 

(Exercises  112-112) 
Euclidean  n -space,  282  (see  p.  281) 
Euler's  constant,  230  (Exercise  4.3.14) 
Euler's  theorem,  357-358  (Exercise  2.4.8) 
Existence  of  an  improper  integral,  152 
Existence  theorem,  420 
Expanding  a  determinant,  362-372 
Exponential  function,  70  (Exercise  2.2. 12), 

72  (Exercise  2.2.33),  228,  273 
Extended  mean  value  theorem,  106 
Extended  reals,  7, 
Exterior  point,  289,  526 
Exterior  of  a  set,  23,  289,  526 

F 

Faa  di  Bruno's  formula,  109 
(Exercise  2.5.1 1) 
Fibonnacci  numbers,  17  (Exercise  1.2.17) 
Field 

complete  ordered,  4 

ordered, 2 

properties,  2  (see  p.  1) 
Finite  real,  7 

First  mean  value  theorem  for  integrals, 
139 

Forward  differences,  104,71  (Example  2.2.18), 

112  (Exercises  2.5.19-2.5.22) 
Fredholm's  integral  equation,  546 
Function  31,  32 

absolutely  integrable,  160 
Bessel,  277  (Exercise  277) 
bounded,  47,  60,313 
above,  60,  313 
below,  60,  313 
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of  bounded  variation,  134  (Exercise  3.2.7) 
characteristic,  70  (Exercise  2.2.9),  485 
composite,  58,  311 
decreasing,  44 

differentiable  at  a  point,  73,  323 
domain  of,  31,  32 

exponential,  70  (Exercise  2.2.12),  72 

(Exercise  2.2.33),  227,  273 
generating,  278  (Exercise  4.5.26) 
homogeneous,  357  (Exercise  5.4.8) 
increasing,  44 
infimum  of,  55,  313 
inverse  of,  68 
linear,  325 

locally  integrable,  152 
maximum  of,  60 
monotonic,  44,  67 
nondecreasing,  44 
nonincreasing,  44 
nonoscillatory  at  a  point,  162 
Hth  power  of,  33 
oscillation  of,  171 
piecewise  continuous,  56 
range  of,  31,  32 

rational,  33,  232,  (Exercise  4.3.28), 
276  (Exercise  4.5.4) 

real-valued,  302 

restriction  of,  399 

Riemann  integrable,  1 14,  438 

Riemann-Stieltjes  integrable,  125 

strictly  monotonic,  44 

supremum  of,  3 1 3 

value  of,  31,  32 

vector- valued,  311 
Functions, 

composition  of,  58,  311 

difference  of,  32 

product  of,  32 

quotient  of,  32 

sum  of,  32 
Fundamental  theorem  of  calculus,  143 


G 

Generalized  mean  value  theorem,  83 


Generating  function,  278  (Exercise  4.5.26) 
Geometric  series,  202 
Grouping  terms  of  series,  220 

H 

Heine-Borel  property, 

Heine-Borel  theorem,  172,  66,  172,  293 

Higher  derivatives  of  a  composite  func- 
tion, 345 

Higher  differential,  348 

Homogeneous  function,  357  (Exercise  5.4.8), 
359  (Exercise  5.4.23) 

Homogeneous  polynomial,  359  (Exercise  5.4. 

Homogeneous  system,  375 

Hypercube,  295  (see  p.  294) 

Holder's  inequality,  521 

I 

Identity  matrix,  370 
Image,  394 

Implicit  function  theorem,  420,  423 
Improper  integrability,  146 
Improper  integral,  152 

absolutely  convergent,  160 

change  of  variable  in,  164 

conditionally  convergent,  162 

convergence  of,  152 

divergence  of,  152 

existence  of,  152 

of  a  nonnegative  function,  156 
Incompleteness  of  the  rationals,  6 
Increasing  sequence,  182 
Indeterminate  forms,  91,  93-95 
Induction  assumption,  12 
Induction  proof,  12 
Inequality, 

Holder,  521 

Minkowski,  522 

Schwarz,  284 

triangle,  2,  285 
Infimum 

of  a  function,  60,  313 

of  a  set,  7 

existence  and  uniqueness  of,  7,  9 
(Exercise  1.1.6) 
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Infinite  derivative,  88  (Exercise  2.3.26) 
Infinite  limits,  42,  306,  317,  316(Exercise  5.2.6) 
Infinite  sequence,  179 

in  a  metric  space,  526 
Infinite  series,  210,  244 

convergence  of,  201 

integrability  of,  25 1 

oscillatory,  201 
Infinity  norm,  496,  523,  524 
Inner  product,  284 
Instantaneous 

rate  of  change,  74 

velocity,  74 
Integrability 

conditional,  162 

of  a  continuous  function,  133 

of  a  function  of  bounded  variation, 
134  (Exercise  3.2.7) 

improper,  152 

of  an  infinite  series,  25 1 

local,  152 

of  a  mono  tonic  function,  133 

of  a  power  series,  264 
Integrable 

Riemann,  114,438 

Riemann-Stieltjes,  125 
Integral 

over  an  arbitrary  set  in  R",  452 
of  a  constant  times  a  function,  136, 

456 
double,  439 
improper,  151 
iterated,  462 
lower 

for  Riemann  integral,  1 20,  442 
for  Riemann-Stieltjes  integral  128 
(Exercise  3.1.17) 

multiple,  439 

ordinary,  439 

of  a  product,  138,456 

proper,  153 

over  a  rectangle  in  W ,  436  (See  p.  435) 
Riemann,  1 14,  438 


Riemann-Stieltjes,  125,  127  (Exer- 
cise 3.1.16),  135  (Exercises  3.2.8- 
3.2.10),  151  (Exercise  3.3.23) 

over  subsets  of  W ,  436  (See  p.  435), 
450,452,471-472 

of  a  sum,  136, 456 

test,  207 

triple,  439 
Integration  by  parts,  144 

for  Riemann-Stieltjes  integrals,  135 
(Exercise  3.2.8) 
Interior  of  a  set,  21,  289 
Interior  point,  21,  289,  525 
Intermediate  value  theorem 

for  continuous  functions,  63,  313 

for  derivatives  82 
Intersection  of  sets,  20 
Interval 

closed,  23 

half  closed,  23 

half  open,  23 

open,  21 

semi-infinite,  21,  23 
Interval  of  convergence,  258 

for  derivatives,  82 
Inverse  function,  68 

branch  of,  409 

derivative  of,  86  (Exercise.  2.3.14) 

of  a  function  restricted  to  a  set,  399 

of  a  matrix,  370 

of  a  transformation,  396 
Inverse  function  theorem,  412 
Invertible,  locally,  400 
Invertible  transformation,  396 
Irrational  number,  6 
Isolated  point,  23,  289,  526 
Iterated  integral,  462 
Iterated  logarithm,  97  (Example  2.4.42), 
167  (Exercise  3.4.10),  208  230 
(Exercise  4.3.1 1),  230  (Exercise  4.3 

J 

Jacobian,  384,  426 
Jordan  content,  485 

changed  by  linear  transformation,  488 
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Jordan  measurable  set,  485,  488 
Jump  discontinuity,  56 

L 

Lebesgue  measure  zero,  175,  177  (Exer- 
cises 3.5.7,  3.5.8) 
Lebesgue's  existence  criterion,  176 
Left  limit  inferior,  47 
Left  limit  superior,  47 
Left-hand  derivative,  79 
Left-hand  limit,  38 

Legendre  polynomial,  278  (Exercise  4.5.27) 

Leibniz's  rule,  86,  (Exercise.  2.3.12) 

Length  of  a  vector,  283 

1' Hospital's  rule,  88 

Limit  of  a  real-valued  function,  302 

Limit 

along  a  curve,  315  (Exercise  5.2.3) 
in  the  extended  reals,  43 
inferior  of  a  sequence,  188 
left,  47 

infinite,  42,  306,  316  (Exercise  5.2.6) 

at  infinity,  307,  316  (Exercise  5.2.6) 

left-hand,  38 

one-sided,  37,  40 

point,  23,  289,  526 

pointwise,  234,  238,  244 

at  ±oo,  40 

of  a  real-valued  function 
as  x  approaches  xq,  34 
as  x  approaches  oo,  40 
as  x  approaches  — oo,  50  (Exer- 
cise 2.1.14) 

right-hand,  39 

of  a  sequence,  179,  292 

uniqueness  of,  35,  305 

of  a  sum,  product,  or  quotient,  35, 
305 

superior,  left,  47 

superior  of  a  sequence,  188 

uniform,  237 

uniqueness  of,  35,  305 
Line  segments  inR",  288 
Line,  parametric  representation  of,  288- 
289 


Linear  function,  325 
Linear  transformation,  362 

change  of  content  under,  490 

differential  of,  367 

matrix  of,  363 
Lipschitz  condition,  84,  87  (Exercise  2.3. 
140 

Local  extreme  point,  80,  334 
Local  extreme  value,  80 
Local  integrability,  152 
Local  maximum  point,  80,  334 
Local  minimum  point,  80,  334 
Locally  invertible,  400 
Lower  bound,  7 
Lower  integral,  1 20,  442 
Lower  sum,  120,  442 

M 

Maclaurin's  series,  264 
Magnitude,  2 

Main  diagonal  of  a  matrix,  370 
Mathematical  induction,  10,  13 
Matrices 

product  of,  364 

sum  of,  364 
Matrix 

adjoint,  370 

of  a  composition  of  linear  transfor- 
mations, 366 
differential,  367,  381 
elementary,  488 
identity,  370 
inverse,  370 

of  a  linear  transformation,  363 

main  diagonal  of,  370 

nonsingular,  370 

norm  of,  368 

scalar  multiple  of,  364 

singular,  370 

square,  368  (See  p.  369) 

transpose  of,  370 
Maximum  value,  local,  80 
Maximum  of  a  function,  60 
Mean  value  theorem,  83,  347 

extended,  106 
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generalized,  83 

for  integrals,  138,  144 
Metric,  518 

discrete,  519 

induced  by  a  norm,  520 
Metrics,  equivalent,  529 
Metric  space,  518 

complete,  527 

connected,  547  (Exercise  8.3.2) 
Minimum  of  a  function,  60 
Minimum  value,  local,  80 
Minkowski's  inequality,  522 
Monotonic  function,  44,  67,  84 

integrability  of,  133 
Monotonic  sequence,  182 
Multinomial  coefficient,  322,  336  (Exer- 
cise 5.3.12) 
Multiple  integral,  439 
Multiplication 

of  matrices,  364 

of  series,  223 

scalar,  519 

Multiplicity  of  a  zero,  87  (Exercise.  2.3.21), 
108  (Exercises  2.5.5-2.5.7) 

N 

Natural  numbers,  10 
n-ball,  290-291 

Negative  definite  polynomial,  353 
Negative  semidefinite  polynomial,  353 
Neighborhood,  21,  289,  525 

deleted,  22,  525 

deleted  e,  22 

e,  21 

Nested  sets,  292,  529 

principle  of,  292,  529 
Nondecreasing  sequence,  182 
Nondegenerate  coordinate  cube,  437 
Nondenumerable  set,  176 
Nonempty  set,  4 
Nonincreasing  sequence,  182 
Nonoscillatory  at  a  point,  162 
Nonsingular  matrix,  370 
Nontrivial  solution,  375 
Norm 


infinity,  496,  523,  524 

of  a  matrix,  368 

metric  induced  by,  520 

of  a  partition,  114,  437 

on  a  vector  space,  519 
Normed  vector  space,  519 
«th  derivative,  73 
Hth  partial  sum  of  a  series,  201 
«th  term  of  a  series,  201 
Number,  natural,  10 
Number,  prime,  1 5 

o 

One-sided  derivative,  79 
One-sided  limit,  37 
One-to-one  transformation,  396 
Open  ball,  525 
Open  covering,  25,  293,  535 
Open  interval,  21 
Open  M-ball,  290 
Open  set,  21,289,  525 
Ordered  field,  2 

complete,  4 
Order  relation,  2 
Ordinary  derivative,  317 
Ordinary  integral,  439 
Origin  of  E" ,  283 
Oscillation  of  a  function,  171 

at  a  point,  172 
Oscillatory  infinite  series,  201 

P 

Parametric  representation  of  a  line,  288, 
289 

Partial  derivative,  317 

rth  order,  319 
Partial  sums,  244 
Partition,  114,  437 

norm  of,  1 14,  437 

points,  114 

refinement  of,  1 14,  438 
Path,  polygonal,  296 
Peano's  postulates,  10-11 
Piecewise  continuous  function,  56 
Point,  19 
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boundary,  23,  289,  526 

critical,  81,  335 

exterior,  23,  289,  526 

at  infinity,  7 

interior,  21,  289 

isolated,  23,  289,  526 

limit,  23,  289,  526 

local  extreme,  80,  334 

local  maximum,  80,  334 

local  minimum,  80,  334 

in  terms  of  sequences,  197 
Pointwise  convergence 

of  a  sequence  of  functions,  234,  238 

of  a  series,  244 
Pointwise  limit,  234,  238,  244 
Polar  coordinates,  397,  502,  505 
Polygonal  path,  296 
Polygonally  connected,  296 
Polynomial,  33,  98 

homogeneous,  352 

negative  definite,  353 

negative  semidefinite,  353 

positive  definite,  353 

positive  semidefinite,  353 

semidefinite,  353 

Taylor,  99,  351 
Power  series,  257 

arithmetic  operations  with,  267 

continuity  of,  260-26 1 

convergence  of,  257 

differentiability  of,  260-261 

integration  of,  264 

of  a  product,  268 

of  a  reciprocal,  27 1 

of  a  quotient,  269 

uniqueness  of,  263 
Prime,  15 

Principal  value,  155 

Principle  of  mathematical  induction,  1 1 , 
14 

Principle  of  nested  sets,  529 

Product 

Cartesian,  31,  436  (see  p.  435) 
Cauchy,  226,  233  (Example  4.3.40) 
inner,  284 


of  matrices,  364 
of  power  series,  268 
of  series,  223 
Proper  integral,  153 

R 

W,  282  (see  p.  281) 

rfh  order  partial  derivative,  319 

Raabe's  test,  212 

Radius  of  convergence,  258 

Range  of  a  function,  31,  32,  543 

Ratio  of  a  geometric  series,  202 

Ratio  test,  210 

Rational  function,  33,  232  (Exercise  4.3.28), 

276  (Exercise  4.5.4) 
Rational  numbers,  2 

density  of,  6 

incompleteness  of,  6 
Real  line,  19 
Real  number  system,  19 
Real-valued  function, 

of  n  variables,  302 

of  a  real  variable,  3 1 
Reals,  extended,  7 
Rearrangement  of  series,  221 
Rectangle,  coordinate,  437 
Refinement  of  a  partition,  1 14,  438 
Region,  295,  297 
Region  of  integration,  476 
Regular  transformation,  405 
Remainder  in  Taylor's  formula,  405 
Removable  discontinuity,  58 
Restriction  of  a  function,  399 
Riemann  integrable,  1 14,  438 
Riemann  integral  1 14  (see  p.  113),  438 

uniqueness  of,  125  (Exercise  3.1.1) 
Riemann  sum,  1 14,  438 
Riemann-Stieltjes  integral,  125 

integration  by  parts  for,  135  (Exer- 
cise 3.2.8) 
Riemann-Stieltjes  sum,  125 
Right  limit  inferior,  53  (Exercise  2.1.39) 
Right  limit  superior,  53  (Exercise  2.1.39) 
Right-hand  derivative,  79 
Right-hand  limit,  39 
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Rolle's  theorem,  82 


Scalar  multiple,  282 
Scalar  multiplication,  519 
Schwarz's  inequality,  284 
Secant  plane,  332-333 
Second  derivative,  73 
Second  derivative  test,  103 
Second  mean  value  theorem  for  integrals, 
144 

Sequence,  179,  526 
bounded,  181,  292 
bounded  above,  181 
bounded  below,  181 
Cauchy,  527 

convergence  of,  179,  292,  526 
decreasing,  182 
divergent,  179 

to  ±oo,  181 
of  functional  values,  183 
of  functions, 

pointwise,  234 
increasing,  182 
limit  of,  179,  292 

uniform,  237 
limit  inferior  of,  188 
limit  superior  of,  188 
monotonic,  182 
nondecreasing,  182 
nonincreasing,  182 
Hth  term  of,  179 
terms  of,  179 
unbounded,  292 
uniformly  convergent,  237 
Series 

alternating,  203 
binomial,  266 

Cauchy  product  of,  226,  233  (Exer- 
cise 4.3.40),  280  (Exercise  4.5.32) 
differentiability  of,  252 
divergent,  201 
geometric,  202 
grouping  terms  in,  220 
Maclaurin,  264 


multiplication  of,  223 
of  nonnegative  terms,  205 
partial  sums  of,  244 
power,  257 
product  of,  218 
rearrangement  of,  221 
Taylor,  223 

term  by  term  differentiation  of,  252 
term  by  term  integration  of,  25 1 
uniformly  convergent,  246 

Set 

boundary  of,  23,  289,  526 
bounded,  7,  536 

above,  3 

below,  7 
closed,  21,  289,  525 
closure  of,  23,  289,  526 
compact,  26,  293,  536 
complement  of,  20 
connected,  295 
containment  of,  19 
content  of,  485 

dense,  6,  29  (Example  1.3.22),  70 

(Exercise  2.2.10) 
denumerable,  176 
diameter  of,  292,  536 
disconnected,  295 
empty,  4 

exterior  of,  23,  289,  526 
interior  of,  21,289,  525 
nondenumerable,  176 
nonempty,  4 
open,  21,  289,  525 
singleton,  20 
strict  containment  of,  20 
subset  of,  19 
totally  bounded,  538 
unbounded  below,  7 
uniformly  bounded,  539 
universal,  19 

Sets 

disjoint,  20 
equality  of,  19 
intersection  of,  20 
nested,  529 
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union  of,  20 
Simple  zero,  108  (Exercise  2.5.5) 
Singleton  set,  20 
Singular  matrix,  370 
Solution  of  a  system  of  linear  equations 

nontrivial,  375 

trivial,  375 
Space 

metric,  518 

vector,  519 
Spherical  coordinates,  507 
Square  matrix,  368  (see  p.  369) 
Subsequence,  195 

of  a  convergent  sequence,  196,  527 
Subset,  19 

Subspace  of  a  vector  space,  519 

Successor,  11 

Sum 

of  matrices,  364 
Riemann,  1 14,  438 
lower,  120,  442 
upper,  120,  442 
Riemann-Stieltjes,  125 
of  vectors,  282 
Summation  by  parts,  218 
Supremum 

of  a  function,  60,  313 
of  a  set,  3 

existence  and  uniqueness  of,  4 
Surface,  331 

differentiable,  453 


Tangent 

to  a  curve,  75 

line,  75 

plane,  332 
Taylor  polynomial,  99,  35 1 

of  a  composite  function,  1 09  (Exer- 
cise 2.5.11) 

of  a  product,  109  (Exercise  2.5.10) 

of  a  reciprocal,  110  (Exercise  2.5. 12) 
Taylor  series,  264 

convergence  of,  264 
Taylor's  theorem 


for  functions  of  n  variables,  350 
for  a  function  of  one  variable,  104 

Terms  of  a  sequence,  179 

Term  by  term  differentiation,  252 

Term  by  term  integration,  25 1 

Test 

Cauchy's  root,  215 
comparison 

for  improper  integrals,  156 
for  series,  206 
integral,  207 
Raabe,  212 
ratio,  210 

second  derivative,  103 
Topological  properties  of  W,  282  (See 

p.  281) 
Topological  space,  26 
Total  variation,  134  (Exercise  3.2.7) 
Totally  bounded,  538 
Transformation,  362 

affine,  380 

analytic,  416  (Exercise  6.3.17) 

continuous,  379 

differentiable,  339,  379-380 

differential  of,  381 

inverse  of,  396 

invertible,  396396 

linear,  362 

one-to-one,  396 

regular,  405 
Transitivity  of  <,  31 
Transpose  of  a  matrix,  370 
Triangle  inequality,  2,  285 

in  a  metric  space,  518 
Triple  integral,  439 
Trivial  solution,  375 

U 

Unbounded 

above,  7 

below,  7 

sequence,  292 
Unconditional  divergence,  233  (Exercise  4.3.38) 
Uniform  continuity,  64,  72  (Exercises  2.2.30- 
2.2.32),  544 
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Weierstrass's  test,  246 
Z 

Zero  content,  448,  460  (Exercises  7.1.14, 
7.1.15),  461  (Exercises  7.1.16- 
7.1.19),  487,  5 14  (Exercise  7.3.2), 
515  (Exercise.  7.3.11) 

Zero 

multiplicity  of,  108  (Exercises  2.5.5— 
2.5.7) 

simple  108  (Exercise  2.5.5) 
Zeroth  derivative,  73 

of  limit,  35,305,527 
of  power  series,  263 

of  prime  factorization,  16  (Exercise  1.2.14) 

of  Riemann  integral,  125  (Exercise  3.1.1) 

of  supremum,  4 
Uniform  continuity,  64,  66,  72  (Exercises  2.2.30- 

2.2.32) 
Unit  vector,  283 
Universal  set,  19 
Upper  bound,  3 
Upper  integral,  120,  442 
Upper  sum,  120,  442 


for  functions  of  n  variables,  314,  392 
(Exercise  6.2.10) 
Uniform  convergence 

properties  preserved  by 
continuity,  242 
differentiability,  243 
integrability,  242 
of  a  sequence,  236 
of  a  series,  246 
Uniformly  bounded  set  in  C  [a,  b],  539 
Union  of  sets,  20 
Uniqueness 


V 

Value 

of  a  function,  31,  32 

local  maximum,  80 

local  minimum,  80 

principal,  155 
Variation,  total,  134  (Exercise  3.2.7) 
Vector,  283,519 
Vector  space,  283,519 

normed,  519 

subspace  of,  519 
Vector  sum,  282 
Vector,  unit,  283 

Vector- valued  function,  362  (see  p.  361) 
continuous,  379 
differentiable,  379-380 


W 

Weighted  average,  139 


